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NUCLEIC ACID LABELING COMPOUNDS 



Statement of Rights to Inventions made under 
Federally Sponsored Research 

This invention was made with Government support under contract 
70NANB5H1031 awarded by the Advanced Technology Program of the 
1 0 National Institute of Standards and Technology. The Government has certain 
rights in this invention. 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application claims priority from U.S. provisional patent 
Application Serial No. 60/275,202, filed on March 12, 2001, which is herein 
1 5 incorporated by reference for all purposes. 

BACKGROUND OF THE INVENTION 

Gene expression in diseased and healthy individuals is oftentimes 
different and characterizable. The ability to monitor gene expression in such 
cases provides medical professionals with a powerful diagnostic tool. This form 
20 of diagnosis is especially important in the area of oncology, where it is thought 
that the overexpression of an oncogene, or the underexpression of a tumor 
suppressor gene, results in tumorogenesis. See Mikkelson et al. J. Cell 
Biochem. 1991,46,3-8. 

One can indirectly monitor gene expression, for example, by 
25 measuring a nucleic acid (e.g., mRNA) that is the transcription product of a 
targeted gene. The nucleic acid is chemically or biochemically labeled with a 
detectable moiety and allowed to hybridize with a localized nucleic acid probe of 
known sequence. The detection of a labeled nucleic acid at the probe position 
indicates that the targeted gene has been expressed. See, e.g. , International 
30 Application Publication Nos.WO 97/27317, WO 92/10588 and WO 97/10365. 
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The labeling of a nucleic acid is typically performed by covalently 
attaching a detectable group (label) to either an internal or terminal position. 
Scientists have reported a number of detectable nucleotide analogues that have 
been enzymatically incorporated into an oligo- or polynucleotide, Langer et al., 
5 for example, disclosed analogues of dUTP and UTP that contain a covalently 
bound biotin moiety. Proc. Natl. Acad. ScL USA 1981, 75, 6633-6637. The 
analogues, shown below, possess an allylamine linker arm that is attached to the 
C-5 position of the pyrimidine ring. The dUTP and UTP analogues, wherein R 
is H or OH, were incorporated into a polynucleotide. 



Petrie et al. disclosed a dATP analogue, 3-[5-[(N-biotinyl-6- 
aminocaproyl)-amino]pentyl]-l-(2-deoxy-P-D-ei7thro-pentofuranosyl)-lH- 
pyrazolotS^-dlpyrimidin^-amine-S'-triphosphate. Bioconjugate Chem. 1991, 
1 5 2, 441-446. The analogue, shown below, is modified at the 3-position with a 
linker arm that is attached to a biotin moiety. Petrie et al. reported that the 
compound wherein R is biotin is incorporated into DNA by nick translation. 




HO 



R 



10 




Prober et al. disclosed a set of four dideoxynucleotides, each 
containing a succinylfluorescein dye. Science 1987, 235, 336-341. The 
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dideoxynucleotides, one of which is shown below, were enzymatically 
incorporated into an oligonucleotide through a template directed extension of a 
primer. The compounds provided for a DNA sequencing method based on gel 
migration. 




CH 3 

CH 2 C(0)N(CH 3 )-C(0)NHCH 2 



H 4 0 9 P 3 -0 




Herrlein et al. disclosed modified nucleoside trisphosphates of the 
four DNA bases. Helv. Chim. Acta 1994, 77, 586-596. The compounds, one of 
10 which is shown below, contain a 3' -amino group containing radioactive or 

fluorescent moieties. Herrlein et al. further described the use of the nucleoside 
analogues as DNA chain terminators. 




1 5 Cech et al. disclosed 3'-amino-functionalized nucleoside 

triphosphates. Collect Czech. Chem. Commun. 1996, 61, S297-S300. The 
compounds, one of which is shown below, contain a fluorescein attached to the 
3 '-position through an amino linker. Cech et al. proposed that the described 
functionalized nucleosides would be useful as terminators for DNA sequencing. 

20 
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The development of novel nucleic acid labeling compounds that are 
effectively incorporated into a nucleic acid to provide a readily detectable 

5 composition would benefit genetic analysis technologies. It would aid, for 

example, in the monitoring of gene expression and the detection and screening of 
mutations and polymorphisms. Such a compound should be suitable for 
enzymatic incorporation into a nucleic acid. Furthermore, the nucleic acid to 
which the labeling compound is attached should maintain its ability to bind to a 

10 probe, such as a complementary nucleic acid. 

Although nucleic acid labeling compounds for use as coupling agents 
for probes are available there is a continuing need for additional compounds that 
are more efficient labeling compounds. There also exists a need for compounds 
that have increased solubility. This will make the compounds more useful for 
1 5 monitoring gene expression. 

SUMMARY OF THE INVENTION 

The present invention relates to nucleic acid labeling compounds. 
More specifically, the invention provides compounds containing a detectable 

20 moiety. The invention also provides methods of making these compounds. It 
further provides methods of attaching the compounds to a nucleic acid. The 
nucleic acid labeling compounds or the present invention are effectively 
incorporated into a nucleic acids to provide readily detectable compositions that 
are useful for genetic analysis technologies. These compounds and the 

25 detectable compositions can aid, for example, in the monitoring of gene 
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expression and the detection and screening of mutations and polymorphisms. 
Thus, the compounds of the invention are suitable for enzymatic incorporation 
into nucleic acids. Furthermore, the nucleic acids to which the labeling 
compound are attached maintain their ability to bind to a probe, such as, for 
5 example a complementary nucleic acid. 

The present invention provides nucleic acid labeling compounds that 
are capable of being enzymatically incorporated into a nucleic acid. The nucleic 
acids to which the compounds are attached maintain their ability to bind to a 
complementary nucleic acid sequence. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a non-limiting set of template moieties. 

Figure 2-10 illustrate synthetic routes to fluorescein and biotin 
labeled compounds of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

15 The nucleic acid labeling compounds of the present invention have 

the following structure: 

A-T-H c -L~(M) m -Q, or A-T(H c )-L-(M) m -Q, or A-T-Q 

wherein A is hydrogen or a functional group that permits the attachment of the 
nucleic acid labeling compound to a nucleic acid; T is a template moiety; He is a 
20 heterocyclic group; L is a linker moiety; each M is a connecting group; Q is a 
detectable moiety. 

In one embodiment, the nucleic acid labeling compounds have the 
following structures: 



NH 2 

N ^k N ^L— (M)— Q 



A-0-^X7 
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expression and the detection and screening of mutations and polymorphisms. 
Thus, the compounds of the invention are suitable for enzymatic incorporation 
into nucleic acids. Furthermore, the nucleic acids to which the labeling 
compound are attached maintain their ability to bind to a probe, such as, for 
5 example a complementary nucleic acid. 

The present invention provides nucleic acid labeling compounds that 
are capable of being enzymatically incorporated into a nucleic acid. The nucleic 
acids to which the compounds are attached maintain their ability to bind to a 
complementary nucleic acid sequence. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a non-limiting set of template moieties. 

Figure 2-10 illustrate synthetic routes to fluorescein and biotin 
labeled compounds of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

1 5 The nucleic acid labeling compounds of the present invention have 

the following structure: 

A-T-H c -L-(M) m -Q, or A-T(H c )-L-(M) m -Q, or A-T-Q 

wherein A is hydrogen or a functional group that permits the attachment of the 
nucleic acid labeling compound to a nucleic acid; T is a template moiety; He is a 
20 heterocyclic group; L is a linker moiety; each M is a connecting group; Q is a 
detectable moiety. 

In one embodiment, the nucleic acid labeling compounds have the 
following structures: 

NH 2 

N ^ N -L— (M)-Q 



25 




r 
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oV 



NH 2 

N ^. N -L— (M)-Q 



<5 %, 



;or 



IA 



NH 2 

N< k N ^L— (M)— Q 



oV 



A-0-%_X./ 



"Vr 



Y Z 

IB 

wherein A is hydrogen or a functional group that permits the 
10 attachment of the nucleic acid labeling compound to a nucleic acid; X is O, S, 
NRj or CHR 2 , wherein Ri and R 2 are, independently, hydrogen, alkyl or aryl; Y 
is hydrogen, N 3 , F, OR 9 , SR 9 or NHR 9 , wherein R 9 is hydrogen, alkyl or aryl; Z 
is hydrogen, N 3 , F, OR 9 , SR 9 or NHRio, wherein Rio is hydrogen, alkyl or aryl; L 
is a linker moiety; Q is a detectable moiety; and M is a connecting group, 
1 5 wherein m is an integer ranging from 0 to about 20. 

In another embodiment, A is H or H4O9P3-; X is O; Y is H or OR 9 , 
wherein R 9 is hydrogen, alkyl or aryl; Z is hydrogen, N 3 , F or OR10, wherein Rio 
is hydrogen, alkyl or aryl; and L is amido alkyl and M is -(Mi) a -(M2)b- wherein a 
and b are independently integers from 0 to about 5 and the sum of a and b is not 
20 greater than 15. 

In another embodiment, L is -(CH 2 ) n C(0)NR3-, wherein R 3 is 
hydrogen, alkyl or aryl and n is an integer ranging from about 1 to about 10; Mi 
is -(CH 2 )iO- and M 2 is -(CH 2 )jNH-, wherein i and j are independently integers 
from 1 to about 5. 
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In another embodiment, Y is H or OH; Z is H or OH; -L is 
-CH 2 C(0)NH-; Mi is ^CH 2 CH 2 0) y and M 2 is -CH 2 CH 2 NH-; and Q is biotin or 
a carboxyfluorescein. 

In another embodiment, Y is OH; Z is OH; -L-(M) m - is 
5 -CH 2 C(0)NH-(CH 2 CH 2 0)3-CH2CH 2 NH~; and Q is biotin or Y is OH; Z is OH; 
-L-(M)m- is -CH 2 C(0)NH-(CH 2 CH 2 0)3-CH 2 CH 2 NH-; Q is carboxyfluorescein. 

In another embodiment, A is H or H4O9P3-; X is O; Y is H or OR 9 , 
wherein R 9 is hydrogen, alkyl or aryl; Z is hydrogen, N 3 , F or OR10, wherein R 10 
is hydrogen, alkyl or aryl; and L is amido alkyl and M is 
10 -(Mi)a-(M 2 ) b -(M3)c-( M 4)<j- wherein a, b, c, and d are independently integers from 
0 to about 5 and the sum of a, b, c, and d is not greater than 15. 

In another embodiment, L is -(CH 2 ) n C(0)NR 3 -, wherein R 3 is 
hydrogen, alkyl or aryl and n is an integer ranging from about 1 to about 10; 
each M is independently -C(0)(CH 2 ) k O-, -(CH 2 )jO- or -(CH 2 )jNH-,wherein i J 
15 and k are independently integers from 1 to about 5. 

In another embodiment, L is -CH 2 C(0)NR 3 -, wherein R 3 is hydrogen 
or, alkyl; M, is-(CH 2 ) 2 NH- M 2 is -C(0)(CH 2 ) 2 0-, M 3 is -(CH 2 CH 2 0) 3 - and M< 
is -(CH 2 ) 2 NH-. 

In another embodiment, Y is OH; Z is OH; -L-(M)„r is 
20 .CH 2 C(0)NH-CH 2 CH 2 NH-C(0)CH 2 CH 2 0-(CH 2 CH 2 0)3-CH 2 CH 2 NH-; Q is 

biotin. 

In another embodiment, Y is OH; Z is OH; -L-(M) m - is 
-CH 2 C(0)NH-CH 2 CH 2 NH-; m is 1; and Q is carboxyfluorescein. 

In another embodiment, L is -(CH 2 ) 2 C(0)NR 3 -> wherein R 3 is 
25 hydrogen or, alkyl; M, is-(CH 2 ) 2 NH-, M 2 is -C(0)(CH 2 ) 2 0-, M 3 is 
-(CH 2 CH 2 0) 3 - and M 4 is -(CH 2 ) 2 NH-. 

In another embodiment, Y is OH; Z is OH; -L-(M) m - is 
-CH 2 C(0)NH-CH 2 CH 2 NH-C(0)CH 2 CH 2 0-(CH 2 CH 2 0)3-CH 2 CH 2 NH-; Q is 
biotin. 
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In another embodiment, Y is OH; Z is OH; -L-(M) m - is 
-CH 2 CH 2 C(0)NH-CH 2 OT 2 NH-; m is 1; and Q is carboxyfluorescein. 

In another embodiment, A is H or H4O9P3-; X is O; Y is H or OR9, 
wherein R 9 is hydrogen, alkyl or aryl; Z is hydrogen, N 3 , F or OR10, wherein R| 0 
5 is hydrogen, alkyl or aryl; and L is functionalized alkyl and M is 

-(MiV(M 2 )b-(M3) c -(M 2 )d- wherein a, b, c, and d are independently integers from 
0 to about 5 and the sum of a, b, c, and d is not greater than 15. 

In another embodiment, L is -(CH 2 ) n O-, wherein n is an integer 
ranging from about 1 to about 1 0; each M is independently -C(0)(CH 2 ) k O-, 
10 -(CH 2 )iO- or -(CH 2 )jNH-, wherein i, j and k are independently integers from 1 to 
about 5. 

In another embodiment, L is -CH 2 0-; Mi is -(CH 2 ) 2 NH- M 2 is 
-C(0)(CH 2 ) 2 0-, M 3 is -(CH 2 CH 2 0) 3 - and M 4 is -(CH 2 ) 2 NH-. 

In another embodiment, Y is OH; Z is OH; -L-(M) m - is 
1 5 -CH 2 0-CH 2 CH 2 NH-C(0)(CH 2 ) 2 0-(CH 2 CH 2 0) 3 -CH 2 CH 2 NH-; Q is biotin or Y 
is OH; Z is OH; -L-(M) m - is -CH 2 0-CH 2 CH 2 NH-; m is 1; and Q is 
carboxyfluorescein. 

In another embodiment, L is -(CH 2 ) n NR 3 -, wherein R 3 is hydrogen, 
alkyl or aryl and n is an integer ranging from about 1 to about 10; each M is 
20 independently -C(0)(CH 2 )kO-, -(CH 2 )jO- or -(CH 2 )jNH- wherein i, j and k are 
independently integers from 1 to about 5. 

In another embodiment, L is -(CH 2 ) 6 NH-; Mi is-(CH 2 ) 2 NH-, M 2 is 
-C(0)(CH 2 ) 2 0-, and M 3 is -(CH 2 CH 2 0) 3 -. In another embodiment, Y is OH; Z 
is OH; -L-(M) m - is .(CH 2 ) 6 NH-C(0)(CH 2 ) 2 0-(CH 2 CH 2 0) 3 -CH 2 CH 2 NH-; and Q 
25 is biotin or Y is OH; Z is OH; L is -(CH 2 ) 6 NH-; m is 0; and Q is 
carboxyfluorescein. 

In another embodiment, A is H or H4O9P3S X is O; Y is H or OR9, 
wherein R 9 is hydrogen, alkyl or aryl; Z is hydrogen, N 3 , F or OR10, wherein Ri 0 
is hydrogen, alkyl or aryl; and L is alkenyl alkyl and M is 
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-(M|)a-(M 2 )b-(M 3 )c-(M4)d- wherein a, b, c, and d are independently integers from 
0 to about 5 and the sum of a, b, c, and d is not greater than 15. 

In another embodiment, L is - structure 
-CH=CH-(CH 2 )nC(0)N(R6>, wherein R* is hydrogen, alkyl or aryl and n is an 
5 integer ranging from about 0 to about 10; each M is independently 

-C(0)(CH 2 )kO-, -(CH 2 )iO- or -(CH 2 )jNH- wherein i, j and k are independently 
integers from 1 to about 5. 

In another embodiment, L is -CH=CH-C(0)N(H); M, is-(CH 2 ) 2 NH- 
M 2 is -C(0)(CH 2 ) 2 0-, M 3 is -(CH 2 CH 2 0) 3 - and Ma is -(CH 2 ) 2 NH-. 

10 In another embodiment, Y is OH; Z is OH; -L-(M) m - is 

.CH=CH-C(0)NH.CH 2 CH 2 NH-C(0)(CH 2 ) 2 0-(CH 2 CH 2 0) 3 -CH 2 CH 2 NH-; Q is 
biotin or Y is OH; Z is OH; -L-(M) m - is -CH=CH-C(0)NH-CH 2 CH 2 NH-; m is 
1 ; and Q is carboxyfluorescein. 

In another embodiment, the nucleic acid labeling compounds have 
1 5 the following structure: 



L (M)— Q 




wherein A is hydrogen or a functional group that permits the 
20 attachment of the nucleic acid labeling compound to a nucleic acid; L is a linker 
moiety; Q is a detectable moiety; and M is a connecting group, wherein m is an 
integer ranging from 0 to about 3 . 

In another embodiment, A is H or H4O9P3S L is alkanoyl; and Q is 
biotin or a carboxyfluorescein; M is -NR 3 - where R 3 is hydrogen or alkyl, and 
25 wherein m is 1 or 0. 
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In another embodiment, L is -C(0)(CH 2 ) n - wherein n is an integer 
ranging from about 1 to about 10; M is -NH-; Q is biotin or a 
carboxyfluorescein. 

In another embodiment, -L-(M) m - is -C(0)(CH2>6-NH-; and Q is 
5 biotin or -L-(M) m - is -C(0)(CH2)6-NH-; and Q is carboxy fluorescein. 

In another embodiment, the nucleic acid labeling compounds have 
the following structures: 



15 



H 



10 III 



a ""°~^ x i~> N n^r 




15 



Yl Z 1 ; or 

IIIB 



H 



mc 

wherein A is hydrogen or a functional group that permits the 
attachment of the nucleic acid labeling compound to a nucleic acid; Xiis O, S, 
20 NRi or CHR2, wherein R| and R2 are, independently, hydrogen, alkyl or aryl; 
R15 is hydrogen, alkyl or aryl; Yi is hydrogen, N3, F, OR9, SR9 orNHR9, 
wherein R9 is hydrogen, alkyl or aryl; Z\ is hydrogen, N 3 , F, OR9, SR9 or NHR10, 
wherein R] 0 is hydrogen, alkyl or aryl; wherein one of Y| or Z\ is a group having 
the formula -X 2 L-(M) m -Q; X 2 is O, S, or NR] 6 , and Rj 6 is hydrogen, alkyl or 

10 
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aryl; L is a linker moiety; Q is a detectable moiety; and M is a connecting group, 
wherein m is an integer ranging from 0 to about 20. 

In another embodiment, A is H or H4O9P3-; Xj is O; Y\ is hydrogen, 
OR 9 , wherein R 9 is hydrogen, or alkyl, or -NRi 6 -(M) m -Q; wherein Ri 6 is 
5 hydrogen, alkyl or aryl; Z| is hydrogen OR10, wherein Rio is hydrogen, or alkyl, 
or ~NRi6-L-(M) m -Q , wherein Ri 6 is hydrogen, or alkyl; L is N-a!kyl amido; R| 5 
is hydrogen or alkyl; M is -(CH 2 ) n NR 3 -where R 3 is hydrogen or alkyl, and m is 
from 1 to about 15. 

In another embodiment, Y| is hydrogen or OH; Z\ is -NH-L-(M) m -Q; 
10 L is -(CH 2 ) n NHC(0)- where n is an integer from 1 to about 10; M is 

-NH(CH 2 ) P - where p is an integer from 1 to about 10; R15 is hydrogen or 
methyl; and Q is biotin or a carboxyfluorescein. 

In another embodiment, Yj is OH; R15 is hydrogen; -L-(M) m - is 
KCH 2 ) 4 NH-C(0)(CH 2 ) 5 NH-; and Q is biotin. 

15 In another embodiment, Yi is OH; Rj 5 is hydrogen; -L-(M) m - is 

-(CH 2 ) 4 NH-C(0)(CH 2 ) 5 NH-; and Q is a carboxyfluorescein. 

In another embodiment, Y| is OH; R15 is methyl; -L-(M) m - is 
-(CH 2 ) 4 NH-C(0)(CH 2 ) 5 NH-; and Q is biotin. 

In another embodiment, Yi is OH; R15 is methyl; -L-(M) m - is 
20 -(CH 2 ) 4 NH-C(0)(CH 2 ) 5 NH-; and Q is a carboxyfluorescein. 

In another embodiment, Yi is -NH-L-(M) m -Q; Z\ is hydrogen or OH; 
L is -(CH 2 ) n NHC(0)- where n is an integer from 1 to about 1 0; M is 
-NH(CH 2 ) P - where n is an integer from 1 to about 10; R15 is hydrogen or 
methyl; and Q is biotin or a carboxyfluorescein. 

25 In another embodiment, Z\ is OH; R15 is hydrogen; -L-(M) m - is 

-(CH 2 )4NH-C(0)(CH 2 ) 5 NH~; and Q is biotin. 

In another embodiment, of the nucleic acid labeling compound, Z| is 
OH; Ris is hydrogen; -L-(M) m - is -(CH 2 )4NH-C(0)(CH 2 ) 5 NH-; and Q is a 
carboxyfluorescein. 
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In another embodiment, Zi is OH; R15 is methyl; -L-(M) m - is 
-(CH 2 )4NH-C(0)(CH 2 )5NH-; and Q is biotin. 

In another embodiment, Zi is OH; R15 is methyl; -L-(M) m - is 
-(CH 2 )4NH-C(0)(CH 2 )5NH-; and Q is a carboxyfluorescein. 

5 In another embodiment, A is H or H4O9P3-; X| is O; Yi is hydrogen, 

OR9, wherein R 9 is hydrogen, or alkyl, or -0-L-(M) m -Q; Zi is hydrogen OR10, 
wherein R !0 is hydrogen, or alkyl, or -0-L-(M)m-Q, or alkyl; L is alkylene; and 
m is from 1 to about 10. 

In another embodiment, Yi is hydrogen or OH; Zi is -0-L-(M) m -Q; L 
10 is -(CH 2 ) n - where n is an integer from 1 to about 12; M is -NH-; R )5 is hydrogen 
or methyl; and Q is biotin or a carboxyfluorescein. 

In another embodiment, Yi is OH; R15 is hydrogen; -L-(M) m - is 
-(CH 2 ) 6 -NH-; and Q is biotin. 

In another embodiment, wherein Yj is OH; R15 is hydrogen; -L-(M) m - 
1 5 is -(CH 2 )6-NH-; and Q is a carboxyfluorescein. 

In another embodiment, Yi is OH; R !5 is methyl; -L-(M) m - is 
-(CH 2 ) 6 -NH-; and Q is biotin. 

In another embodiment, Yi is OH; R| 5 is methyl; -L-(M) m - is 
-(CH 2 )$-NH-; and Q is a carboxyfluorescein. 

20 In another embodiment, Y| is -0-L-(M) m -Q; Z] is hydrogen or OH; L 

is -(CH 2 ) n - where n is an integer from 1 to about 12; M is -NH-; Rj 5 is hydrogen 
or methyl; and Q is biotin or a carboxyfluorescein. 

In another embodiment, Z\ is OH; Rj 5 is hydrogen; -L-(M) m - is 
-(CH 2 ) 6 -NH-; and Q is biotin. 

25 In another embodiment, Z\ is OH; R J5 is hydrogen; -L-(M) m - is 

-(CH 2 >6-NH-; and Q is a carboxyfluorescein. 

In another embodiment, Z| is OH; R15 is methyl; -L-(M) m - is 
-(CH 2 ) 6 -NH~; and Q is biotin. 
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In another embodiment, Z\ is OH; R15 is methyl; -L-(M) m - is 
-(CH 2 )e-NH-; and Q is a carboxyfluorescein. 

In another embodiment, A is H or H4O9P3-; Xi is O; Y| is hydrogen, 
OR9, wherein R9 is hydrogen, or alkyl, or -S-L-(M) m -Q; Z\ is hydrogen OR10, 
5 wherein Rio is hydrogen, or alkyl, or -S-L-(M) m -Q; L is alkylene;X2 is S; and m 
is from 1 to about 10. 

In another embodiment, Yj is hydrogen or OH; Z\ is -S-L-(M) m -Q; L 
is -S-(CH 2 ) n - where n is an integer from 1 to about 10; M is -NH-; R15 is 
hydrogen or methyl; and Q is biotin or a carboxyfluorescein. 

10 In another embodiment, Y| is OH; R !5 is hydrogen; -L-(M) m ~ is 

-S-(CH 2 ) 2 -NH-; and Q is biotin. 

In another embodiment, Yi is OH; R15 is hydrogen; -L-(M) m - is 
-S-(CH2)2-NH-; and Q is a carboxyfluorescein. 

In another embodiment, Yi is OH; R| 5 is methyl; -L-(M)m- is 
1 5 -S-(CH 2 ) 2 -NH-; and Q is biotin. 

In another embodiment, Yi is OH; R15 is methyl; -L-(M) m - is 
-S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 

In another embodiment, Y| is -S-L-(M) m -Q; Z\ is hydrogen or OH; L 
is -S-(CH 2 ) n - where n is an integer from 1 to about 10; M is -NH-; R| 5 is 
20 hydrogen or methyl; and Q is biotin or a carboxyfluorescein. 

In another embodiment,-Zi is OH; R15 is hydrogen; -L-(M) m - is 
-S-(CH 2 ) 2 -NH-; and Q is biotin. 

In another embodiment, Z\ is OH; R !5 is hydrogen; -L-(M) m - is 
-S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 

25 In another embodiment, Zj is OH; R15 is methyl; -L-(M) m - is 

-S-(CH 2 )2-NH-; and Q is biotin. 

In another embodiment, Z\ is OH; R 15 is methyl; -L-(M) m - is 
-S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 
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In another embodiment, Z\ is OH; Ri 5 is methyl; -L-(M) m - is 
-(CH2)6-NH-; and Q is a carboxyfluorescein. 

In another embodiment, A is H or H4O9P3-; Xi is O; Y\ is hydrogen, 
OR 9 , wherein R 9 is hydrogen, or alkyl, or -S-L-(M)nrQ; Z\ is hydrogen OR| 0 , 
5 wherein Rio is hydrogen, or alkyl, or -S-L-(M) m -Q; L is alkylene;X 2 is S; and m 
is from 1 to about 10. 

In another embodiment, Y| is hydrogen or OH; Z\ is -S-L-(M) m -Q; L 
is -S-(CH 2 ) n - where n is an integer from 1 to about 10; M is -NH-; R !5 is 
hydrogen or methyl; and Q is biotin or a carboxyfluorescein. 

10 In another embodiment, Yi is OH; R !5 is hydrogen; -L-(M) m - is 

-S-(CH 2 ) r NH-; and Q is biotin. 

In another embodiment, Yi is OH; R 15 is hydrogen; -L-(M) m - is 
-S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 

In another embodiment, Yi is OH; R J5 is methyl; -L-(M) m - is 
1 5 -S-(CH 2 ) 2 -NH-; and Q is biotin. 

In another embodiment, Yi is OH; R !5 is methyl; -L-(M) m - is 
-S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 

In another embodiment, Y\ is -S-L-(M) m -Q; Z| is hydrogen or OH; L 
is -S-(CH 2 ) n - where n is an integer from 1 to about 10; M is -NH-; Rj 5 is 
* 20 hydrogen or methyl; and Q is biotin or a carboxyfluorescein. 

In another embodiment,^ is OH; R t5 is hydrogen; -L~(M) m - is 
-S-(CH 2 ) 2 -NH-; and Q is biotin. 

In another embodiment, Z| is OH; R !5 is hydrogen; -L-(M) m - is 
-S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 

25 In another embodiment, Zj is OH; R15 is methyl; -L-(M) m - is 

-S-(CH 2 ) 2 -NH-; and Q is biotin. 

In another embodiment, Z\ is OH; Rj 5 is methyl; -L-(M) m - is 
-S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 
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In another embodiment, the nucleic acid labeling compounds have 
the following structure: 

A-0 — s^X^Xg-G Q 

5 IV 

wherein A is hydrogen or a functional group that permits the 
attachment of the nucleic acid labeling compound to a nucleic acid; Xiis O, S, 
NR| or CHR2, wherein R| and R2 are, independently, hydrogen, alkyl or aryl; X2 
10 is a bond or alkylene; Q is a detectable moiety; and G is -L-(M) m - where Lisa 
linker moiety and each M a connecting group, where m is from 0 to about 20. 

In another embodiment, A is H or H4O9P3-; Xi is 0; X 2 is a bond; and 
G is -C(0)NR 3 -(CH2) m -(CH2CH 2 0) n -CH2CH 2 -C(0)NR 3 -, where R 3 is hydrogen 
or alkyl, and m and n are independently an integer from 1 to about 15. 

15 In another embodiment, G is 

-QONH-tCHzWCCHiCHzOJn-CHzC^-CCOJNH- 

where m is from 1 to about 6 and n is from 1 to about 4. Preferably, 
n is 3 or 4 and m is 5 or 6. 

In another embodiment, A is H or H4O9P3-; Xi is O; X 2 is CH 2 ; and G 
20 is -C(0)NR 3 -(CH2) m -(CH2CH20)„-CH 2 CH2-C(0)NR3-, where R 3 is hydrogen or 
alkyl, and m and n are independently an integer from 1 to about 15. 

In another embodiment, A is H or H4O9P3-; Xi is O; X 2 is CH 2 ; and G 
is -C(0)NR3-(CH2MCH 2 CH 2 0)„-CH2CH 2 .C(0)NR3-, where R 3 is hydrogen or 
alkyl, and m and n are independently an integer from 1 to about 15. 

25 The present invention also provides nucleic acid derivatives produced 

by coupling a nucleic acid labeling compound with a nucleic acid and 
hybridization products comprising the nucleic acid derivatives bound to a 
complementary probe. 

The present invention also provides nucleic acid derivatives produced 
30 by coupling one of the nucleic acid labeling compounds of the invention with a 
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nucleic acid and the hybridization products comprising the nucleic acid 
derivatives bound to a complementary probe. The hybridization product formed 
from the nucleic acid derivatives comprise the nucleic acid derivative bound to a 
complementary probe. In one embodiment, the probe is attached to a glass chip. 

5 The present invention also provides methods of synthesizing nucleic 

acid derivatives by attaching a nucleic acid labeling compound to a nucleic acid. 
It further provides methods of detecting nucleic acids involving incubating the 
nucleic acid derivatives with a probe. 

In yet another embodiment, the methods involve the steps of: (a) 
10 providing at least one nucleic acid coupled to a support; (b) providing a labeled 
moiety capable of being coupled with a terminal transferase to said nucleic acid; 
(c) providing said terminal transferase; and (d) coupling said labeled moiety to 
said nucleic acid using said terminal transferase. 

In still another embodiment, the methods involve the steps of: (a) 
15 providing at least two nucleic acids coupled to a support; (b) increasing the 
number of monomer units of said nucleic acids to form a common nucleic acid 
tail on said at least two nucleic acids; (c) providing a labeled moiety capable of 
recognizing said common nucleic acid tails; and (d) contacting said common 
nucleic acid tails and said labeled moiety. 

20 In still yet another embodiment, the methods involve the steps of: (a) 

providing at least one nucleic acid coupled to a support; (b) providing a labeled 
moiety capable of being coupled with a ligase to said nucleic acid; (c) providing 
said ligase; and (d) coupling said labeled moiety to said nucleic acid using said 
ligase. 

25 This invention also provides compounds of the formulas described 

herein. 

Definitions 

The following definitions are used, unless otherwise described. Halo 
is fluoro, chloro, bromo, or iodo. Alkyl, alkoxy, aralkyl, alkylaryl, etc. denote 
30 both straight and branched alkyl groups; but reference to an individual radical 
such as "propyl" embraces only the straight chain radical, a branched chain 
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isomer such as "isopropyl" being specifically referred to. Aryl includes a phenyl 
radical or an ortho-fused bicyclic carbocyclic radical having about nine to ten 
ring atoms in which at least one ring is aromatic, Heteroaryl encompasses a 
radical attached via a ring carbon of a monocyclic aromatic ring containing five 

5 or six ring atoms consisting of carbon and one to four heteroatoms each selected 
from the group consisting of non-peroxide oxygen, sulfur, and N(X) wherein X 
is absent or is H, O, (Ci-C^alkyl, phenyl or benzyl, as well as a radical of an 
ortho-fused bicyclic heterocycle of about eight to ten ring atoms derived 
therefrom, particularly a benz-derivative or one derived by fusing a propylene, 

1 0 trimethylene, or tetramethylene diradical thereto. 

"Alkyl" refers to a straight chain, branched or cyclic chemical 
groups containing only carbon and hydrogen. Alkyl groups include, without 
limitation, ethyl, propyl, butyl, pentyl, cyclopentyl and 2-methylbutyl. Alkyl 
groups are unsubstituted or substituted with 1 or more substituents (e.g., 
15 halogen, alkoxy, amino). 

"Alkylene" refers to a straight chain, branched or cyclic chemical 
group containing only carbon and hydrogen. Alkyl groups include, without 
limitation, ethylene, propylene, butylene, pentylene, and 2-methylbutylene. 
Alkyl groups are unsubstituted or substituted with 1 or more substituents (e.g., 
20 halogen, alkoxy, amino). 

"Aryl" refers to a monovalent, unsaturated aromatic carbocyclic 
group. Aryl groups include, without limitation, phenyl, naphthyl, anthryl and 
biphenyl. Aryl groups are unsubstituted or substituted with 1 or more 
substituents (e.g. halogen, alkoxy, amino). "Arylene" refers to a divalent aryl 
25 group. 

"Amido" refers to a chemical group having the structure -C(0)NR3-, 
wherein R3 is hydrogen, alkyl or aryl. Preferably, the amido group is of the 
structure -C(0)NR3-where R3 is hydrogen or alkyl having from about 1 to 
about 6 carbon atoms. More preferably, the amido alkyl group is of the structure 
30 -C(0)NH-. 
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"Alkanoyl" refers to a chemical group having the structure 
-(CH 2 ) ft C(0)-, n is an integer ranging from 0 to about 10. Preferably, the 
alkanoyl group is of the structure -(CH 2 )nC(0)-, wherein n is an integer ranging 
from about 2 to about 10. More preferably, the alkanoyl group is of the structure 
5 -(CH 2 )nC(0)-, wherein n is an integer ranging from about 2 to about 6. Most 
preferably, the alkanoyl group is of the structure -CH 2 C(0)- 

"Alkyl amido" refers to a chemical group having the structure 
-R4C(0)NR3-, wherein R3 is hydrogen, alkyl or aryl, and R4 is alkylene or 
arylene. Preferably, the alkyl amido group is of the structure -(CH 2 ) n C(0)NH-, 
10 wherein n is an integer ranging from about 1 to about 10. More preferably, n is 
an integer ranging from about 1 to about 6. Most preferably, the alkyl amido 
group has the structure -(CH 2 ) 2 C(0)NH- or the structure -CH 2 C(0)NH- 

"N- Amido alkyl" refers to a chemical group having the structure 
-C(0)NR3R4-i wherein R3 is hydrogen, alkyl or aryl, and R4 is alkylene or 

15 arylene. Preferably, the N-amido alkyl group is of the structure 

-C(0)NH(CH 2 ) n R5-, wherein n is an integer ranging from about 2 to about 10, 
and R 5 is O, NR 6 , or C(O), and wherein R$ is hydrogen, alkyl or aryl. More 
preferably, the N-amido alkyl group is of the structure -C(0)NH(CH 2 ) n N(H)-, 
wherein n is an integer ranging from about 2 to about 6. Most preferably, the N- 

20 amido alkyl group is of the structure -C(0)NH(CH 2 ) 4 N(H)-. 

"Alkynyl alkyl" refers to a chemical group having the structure 
-OC-R4-, wherein R4 is alkyl or aryl. Preferably, the alkynyl alkyl group is of 
the structure -C=C-(CH 2 ) n R5-» wherein n is an integer ranging from 1 to about 
10, and R 5 is O, NR6 or C(O), wherein R$ is hydrogen, alkyl or aryl. More 
25 preferably, the alkynyl alkyl group is of the structure -C=C-(CH 2 ) n N(H)-, 

wherein n is an integer ranging from 1 to about 4. Most preferably, the alkynyl 
alkyl group is of the structure -OC-CH 2 N(H)-. 

"Alkenyl alkyl" refers to a chemical group having the structure 
-CH=CH-R4-, wherein R4 is a bond, alkyl or aryl. Preferably, the alkenyl alkyl 
30 group is of the structure -CH=CH-(CH 2 )nR5-, wherein n is an integer ranging 
from 0 to about 10, and R 5 is O, NR*, C(0) or C(0)NR«, wherein R* is 
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hydrogen, alkyl or aryl. More preferably, the alkenyl alkyl group is of the 
structure -CH=CH-(CH 2 ) n C(0)NR6- wherein n is an integer ranging from 0 to 
about 4. Most preferably, the alkenyl alkyl group is of the structure 
-CH=CH-C(0)N(H)-. 

5 "Functionalized alkyl" refers to a chemical group of the structure 

-(CH2)nR7-, wherein n is an integer ranging from 1 to about 10, and R 7 is O, S, 
NH or C(O). Preferably, the functionalized alkyl group is of the structure 
-(CH 2 )nC(0)-, wherein n is an integer ranging from 1 to about 4. More 
preferably, the functionalized alkyl group is of the structure -CH 2 C(0)-. 

10 "Alkoxy" refers to a chemical group of the structure -0(CH 2 ) n Rg-, 

wherein n is an integer ranging from 2 to about 10, and R$ is a bond, O, S, NH or 
C(O). Preferably, the alkoxy group is of the structure -0(CH 2 ) n -, wherein n is 
an integer ranging from 2 to about 4. More preferably, the alkoxy group is of the 
structure -OCH 2 CH 2 - 

1 5 "Alkyl thio" refers to a chemical group of the structure -S(CH 2 ) n R8-, 

wherein n is an integer ranging from 1 to about 10, and Rg is a bond, O, S, NH or 
C(O). Preferably, the alkyl thio group is of the structure -S(CH 2 ) n - wherein n 
is an integer ranging from 2 to about 4. More preferably, the thio group is of the 
structure -SCH 2 CH 2 C(0)-. 

20 "Amino alkyl" refers to a chemical group having an amino group 

attached to an alkyl group. Preferably an amino alkyl is of the structure 
-(CH 2 ) n NH-, wherein n is an integer ranging from about 2 to about 10. More 
preferably it is of the structure -(CH 2 ) n NH-, wherein n is an integer ranging 
from about 2 to about 4. Most preferably, the amino alkyl group is of the 

25 structure -(CH 2 ) 2 NH-. 

"Nucleic acid" refers to a polymer comprising 2 or more nucleotides 
and includes single-, double- and triple stranded polymers. "Nucleotide" refers 
to both naturally occurring and non-naturally occurring compounds and 
comprises a heterocyclic base, a sugar, and a linking group, preferably a 
30 phosphate ester. For example, structural groups may be added to the ribosyl or 
deoxyribosyl unit of the nucleotide, such as a methyl or allyl group at the 2*-0 
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position or a fluoro group that substitutes for the 2'-0 group. The linking group, 
such as a phosphodi ester, of the nucleic acid may be substituted or modified, for 
example with methyl phosphonates or O-methyl phosphates. Bases and sugars 
can also be modified, as is known in the art. "Nucleic acid," for the purposes of 
5 this disclosure, also includes "peptide nucleic acids" in which native or modified 
nucleic acid bases are attached to a polyamide backbone. 

The phrase "coupled to a support" means bound directly or indirectly 
thereto including attachment by covalent binding, hydrogen bonding, ionic 
interaction, hydrophobic interaction, or otherwise. 

1 0 "Probe" refers to a nucleic acid that can be used to detect, by 

hybridization, a target nucleic acid. Preferably, the probe is complementary to 
the target nucleic acid along the entire length of the probe, but hybridization can 
occur in the presence of one or more base mismatches between probe and target. 

"Perfect match probe" refers to a probe that has a sequence that is 
1 5 perfectly complementary to a particular target sequence. The test probe is 
typically perfectly complementary to a portion (subsequence) of the target 
sequence. The perfect match (PM) probe can be a "test probe", a "normalization 
control" probe, an expression level control probe and the like. A perfect match 
control or perfect match probe is, however, distinguished from a "mismatch 
20 control" or "mismatch probe." In the case of expression monitoring arrays, 
perfect match probes are typically preselected (designed) to be complementary 
to particular sequences or subsequences of target nucleic acids (e.g., particular 
genes). In contrast, in generic difference screening arrays, the particular target 
sequences are typically unknown. In the latter case, prefect match probes cannot 
25 be preselected. The term perfect match probe in this context is to distinguish 
that probe from a corresponding "mismatch control" that differs from the perfect 
match in one or more particular preselected nucleotides as described below. 

"Mismatch control" or "mismatch probe", in expression monitoring 
arrays, refers to probes whose sequence is deliberately selected not to be 
30 perfectly complementary to a particular target sequence. For each mismatch 
(MM) control in a high-density array there preferably exists a corresponding 
perfect match (PM) probe that is perfectly complementary to the same particular 
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target sequence. In "generic" (eg., random, arbitrary, haphazard, etc.) arrays, 
since the target nucleic acid(s) are unknown perfect match and mismatch probes 
cannot be a priori determined, designed, or selected. In this instance, the probes 
are preferably provided as pairs where each pair of probes differ in one or more 

5 preselected nucleotides. Thus, while it is not known a priori which of the probes 
in the pair is the perfect match, it is known that when one probe specifically 
hybridizes to a particular target sequence, the other probe of the pair will act as a 
mismatch control for that target sequence. It will be appreciated that the perfect 
match and mismatch probes need not be provided as pairs, but may be provided 

10 as larger collections (e.g., 3. 4, 5, or more) of probes that differ from each other 
in particular preselected nucleotides. While the mismatch(s) may be located 
anywhere in the mismatch probe, terminal mismatches are less desirable as a 
terminal mismatch is less likely to prevent hybridization of the target sequence. 
In a particularly preferred embodiment, the mismatch is located at or near the 

1 5 center of the probe such that the mismatch is most likely to destabilize the 
duplex with the target sequence under the test hybridization conditions. In a 
particularly preferred embodiment, perfect matches differ from mismatch 
controls in a single centrally-located nucleotide. 

"Labeled moiety" refers to a moiety capable of being detected by the 
20 various methods discussed herein or known in the art. 

Nucleic Acid Labeling Compounds 

The nucleic acid labeling compounds of the present invention are of 
the following structure: 

A-0-CH 2 -T- H c -L-(M) m -Q, or A-0-CH 2 -T(H c )-L-(M) m -Q, or A-0-CH 2 -T -Q 

25 wherein A is hydrogen or a functional group that permits the 

attachment of the nucleic acid labeling compound to a nucleic acid; T is an 
optional template moiety; He is an heterocyclic group; L is a linker moiety; Q is 
a detectable moiety; and M is an connecting group, wherein m is an integer 
ranging from 0 to about 20. In a preferred embodiment m is from 0 to about 15. 

30 In a more preferred embodiment m is from 0 to about 10. In the most preferred 
embodiment m is from 0 to about 5. 
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The group A is either hydrogen or a functional group that permits the 
attachment of a nucleic acid labeling compound to a nucleic acid. Nonlimiting 
examples of such groups include the following: monophosphate; diphosphate; 
triphosphate (H4O9P); phosphoramidite ((R^NXR'O)?), wherein R is linear, 
5 branched or cyclic alkyl, and R'is a protecting group such as 2-cyanoethyi; and 
H-phosphonate (HP(0)0-HNR 3 ), wherein R is linear, branched or cyclic alkyl. 

The template moiety (T) is covalently attached to a methylene group 
(CH2) at one position and a heterocyclic group (H c ) or the linker moiety at 
another position. A nonlimiting set of template moieties is shown in FIG. 1, 
10 wherein the substituents are defined as follows: X is O, S, NRi or CHR2; Y is H, . 
N 3 , F, OR 9 , SR 9 or NHR 9 ; Z is H, N 3 , F or OR, 0 ; W is O, S or CH 2 ; D is O or S; 
and, G is O, NH or CH 2 . The substituents R lf R 2 , R9 and R| 0 are independent of 
one another and are H, alkyl or aryl. 

The heterocyclic group (H c ) is a cyclic moiety containing both carbon 
1 5 and a heteroatom. 

The linker moiety (L) of the nucleic acid labeling compound is 
covalently bound to the heterocycle (He) or the template moiety at one terminal 
position. It is attached to the detectable moiety (Q) at another terminal position, 
either directly or through a connecting group (M). It is of a structure that is 
20 sterically and electronically suitable for incorporation into a nucleic acid. 
Nonlimiting examples of linker moieties include amido alkyl groups, 
functionalized alkyl groups, alkenyl alkyl groups, alkanoyl groups, and N-alkyl 
amido groups. 

Amido groups have the structure -C(0)NR 3 - wherein R 3 is 
25 hydrogen, alkyl or aryl. Preferably, the amido group is of the structure 
-C(0)NR 3 -where R3 is hydrogen or alkyl having from about 1 to about 6 
carbon atoms. More preferably, the amido alkyl group is of the structure 
-C(0)NH-. 

Alkanoyl groups have the structure ~(CH 2 ) n C(0)-, n is an integer 
30 ranging from 0 to about 10. Preferably, the alkanoyl group is of the structure 
-(CH 2 ) n C(0>-, wherein n is an integer ranging from about 2 to about 10. More 
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preferably, the alkanoyl group is of the structure -(CH^nCCO)-, wherein n is an 
integer ranging from about 2 to about 6. Most preferably, the alkanoyl group is 
of the structure -CH 2 C(0>-. 

Amido alkyl groups have the structure -R4C(0)NR3-, wherein R3 is 
5 hydrogen, alkyl or aryl, and R4 is alkyl or aryl. The amido alkyl group is 
preferably of the structure -(CH 2 ) n C(0)NH~, wherein n is an integer ranging 
from about 1 to about 1 0. More preferably, n is an integer ranging from about 1 
to about 6. Most preferably, the alkyl amido group has the structure 
-(CH 2 ) 2 C(0)NH- or the structure -CH 2 C(0)NH-. 

10 N- Amido alkyl groups have the structure -C(0)NR 3 R4-, wherein R 3 

is hydrogen, alkyl or aryl, and R4 is alkylene or arylene. Preferably, the N- 
amido alkyl group is of the structure -C(0)NH(CH 2 ) n R5-, wherein n is an 
integer ranging from about 2 to about 1 0, and R 5 is O, NR*, or C(O), and 
wherein R* is hydrogen, alkyl or aryl. More preferably, the N-amido alkyl group 

15 is of the structure -C(0)NH(CH 2 ) n N(H)-, wherein n is an integer ranging from 
about 2 to about 6. Most preferably, the N-amido alkyl group is of the structure 
-C(0)NH(CH 2 ) 4 N(H)-. 

Alkenyl alkyl groups are of the structure -CH=CH-R4-, wherein R4 
is alkyl or aryl. The alkenyl alkyl group is preferably of the structure 
20 -CH=CH(CH 2 ) n R5-, wherein n is an integer ranging from 1 to about 1 0, and R5 
is O, NR$, C(O) or C(0)NR 6 , wherein R 6 is hydrogen, alkyl or aryl. More 
preferably, the alkenyl alkyl group is of the structure 

-CH=CH-(CH 2 )nC(0)NR6- wherein n is an integer ranging from 0 to about 4. 
Most preferably, the alkenyl alkyl group is of the structure 
25 -CH=CH-C(0)N(H)-. 

Functionalized alkyl groups are of the structure -(CH 2 ) n R 7 - wherein 
n is an integer ranging from 1 to about 10, and R 7 is O, S, NH, or C(O). The 
functionalized alkyl group is preferably of the structure -(CH 2 ) n C(0)-, wherein 
n is an integer ranging from 1 to about 4. More preferably, the functionalized 
30 alkyl group is -CH 2 C(0)- 
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Alkoxy groups are of the structure -0(CH 2 ) n R8~, wherein n is an 
integer ranging from 2 to about 10, and Rs is 0, S, NH, or C(O). The alkoxy 
group is preferably of the structure -0(CH 2 )n- or -0(CH 2 ) n C(0}-, wherein n is 
an integer ranging from 2 to about 4, More preferably, the alkoxy group is of the 
5 structure -OCH 2 CH 2 ^or -OCH 2 CH 2 C(0)-. 

Alkyl thio groups are of the structure -S(CH 2 ) n R g -, wherein n is an 
integer ranging from 2 to about 10, and Rs is O, S, NH, or C(O). The alkyl thio 
group is preferably of the structure -S(CH 2 ) n C(0)-, wherein n is an integer 
ranging from 2 to about 4. More preferably, the alkyl thio group is of the 
1 0 structure -SCH 2 CH 2 C(0)-. 

Amino alkyl groups comprise an amino group attached to an alkyl 
group. Preferably, amino alkyl groups are of the structure -NH(CH 2 ) n NH-, 
wherein n is an integer ranging from about 2 to about 10. The amino alkyl group 
is more preferably of the structure -(CH 2 ) n NH-, wherein n is an integer ranging 
1 5 from about 2 to about 4. Most preferably, the amino alkyl group is of the 
structure -(CH 2 ) 4 NH-. 

The detectable moiety (Q) is a chemical group that provides an 
signal. The signal is detectable by any suitable means, including spectroscopic, 
photochemical, biochemical, immunochemical, electrical, optical or chemical 
20 means. In certain cases, the signal is detectable by 2 or more means. 

The detectable moiety provides the signal either directly or indirectly. 
A direct signal is produced where the labeling group spontaneously emits a 
signal, or generates a signal upon the introduction of a suitable stimulus. 
Radiolabels, such as 3 H, ,25 1, 35 S, ,4 C or 32 P, and magnetic particles, such as 

25 Dynabeads™, are nonlimiting examples of groups that directly and 

spontaneously provide a signal. Labeling groups that directly provide a signal in 
the presence of a stimulus include the following nonlimiting examples: colloidal 
gold (40 - 80 nm diameter), which scatters green light with high efficiency; 
fluorescent labels, such as fluorescein, texas red, rhodamine, and green 

30 fluorescent protein (Molecular Probes, Eugene, Oregon), which absorb and 
subsequently emit light; chemiluminescent or bioluminescent labels, such as 
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luminol, lophine, acridine salts and luciferins, which are electronically excited as 
the result of a chemical or biological reaction and subsequently emit light; spin 
labels, such as vanadium, copper, iron, manganese and nitroxide free radicals, 
which are detected by electron spin resonance (ESR) spectroscopy; dyes, such as 
5 quinoline dyes, triarylmethane dyes and acridine dyes, which absorb specific 
wavelengths of light; and colored glass or plastic (e.g., polystyrene, 
polypropylene, latex, etc.) beads. See U.S. Patent Nos. 3,817,837; 3,850,752; 
3,939,350; 3,996,345; 4,277,437; 4,275,149 and 4,366,241. 

A detectable moiety provides an indirect signal where it interacts 
with a second compound that spontaneously emits a signal, or generates a signal 
upon the introduction of a suitable stimulus. Biotin, for example, produces a 
signal by forming a conjugate with streptavidin, which is then detected. See 
Hybridization With Nucleic Acid Probes. In Laboratory Techniques in 
Biochemistry and Molecular Biology, Tijssen, P., Ed.; Elsevier: New York, 
1993; Vol. 24. An enzyme, such as horseradish peroxidase or alkaline 
phosphatase, that is attached to an antibody in a label-antibody-antibody as in an 
ELISA assay, also produces an indirect signal. 

A preferred detectable moiety is a fluorescent group. Flourescent 
groups typically produce a high signal to noise ratio, thereby providing increased 
resolution and sensitivity in a detection procedure. Preferably, the fluorescent 
group absorbs light with a wavelength above about 300 nm, more preferably 
above about 350 nm, and most preferably above about 400 nm. The wavelength 
of the light emitted by the fluorescent group is preferably above about 310 nm, 
more preferably above about 360 nm, and most preferably above about 410 nm. 

25 The fluorescent detectable moiety is selected from a variety of 

structural classes, including the following nonlimiting examples: 1- and 2- 
aminonaphthalene, p,p'diaminostilbenes, pyrenes, quaternary phenanthridine 
salts, 9-aminoacridines, p^'-diaminobenzophenone imines, anthracenes, 
oxacarbocyanine, marocyanine, 3-aminoequilenin, perylene, bisbenzoxazole, 

30 bis-p-oxazolyl benzene, 1,2-benzophenazin, retinol, bis-3-aminopridinium salts, 

hellebrigenin, tetracycline, sterophenol, benzimidazolyl phenylamine, 2-oxo-3- 

chromen, indole, xanthen, 7-hydroxycoumarin, phenoxazine, salicylate, 

strophanthidin, porphyrins, triarylmethanes, flavin, xanthene dyes (e.g., 

24 
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fluorescein and rhodamine dyes); cyanine dyes; 4,4-difluoro-4-bora-3a,4a-diaza- 
s-indacene dyes and fluorescent proteins (e.g., green fluorescent protein, 
phycobiliprotein). 

A number of fluorescent compounds are suitable for incorporation 

5 into the present invention. Nonlimiting examples of such compounds include 
the following: dansyl chloride; fluoresceins, such as 3,6-dihydroxy-9- 
phenylxanthhydrol; rhodamineisothiocyanate; N-phenyM -amino-8- 
sulfonatonaphthalene; N-phenyl-2-amino-6-sulfonatonaphthanlene; 4- 
acetamido-4-isothiocyanatostilbene-2,2'-disulfonic acid; pyrene-3 -sulfonic acid; 

10 2-toluidinonapththalene-6-sulfonate; N-phenyl, N-methyl 2-aminonaphthalene- 
6-sulfonate; ethidium bromide; stebrine; auromine-0,2-(9 , -anthroyl)palmitate; 
dansyl phosphatidylethanolamin; N,N'-dioctadecyl oxacarbocycanine; N,N'- 
dihexyl oxacarbocyanine; merocyanine, 4-(3'-pyrenyl)butryate; d-3- 
aminodesoxy-equilenin; l2-(9*-anthroyl)stearate; 2-methylanthracene; 9- 

1 5 vinylanthracene; 2,2'-(vinylene-p-phenylene)bisbenzoxazole; p-bis[2-(4-methyl- 
5-phenyl oxazolyl)]benzene; 6-dimethylamino-l,2-benzophenzin; retinol; bis(3'- 
aminopyridinium)-l,10-decandiyl diiodide; sulfonaphthylhydrazone of 
hellibrienin; chlorotetracycline; N-(7-dimethylamino-4-methyl-2-oxo-3- 
chromenyl)maleimide; N-[p-(2-benzimidazolyl)phenyl]maleimide; N-(4- 

20 fluoranthyl)maleimide; bis(homovanillic acid); resazarin; 4-chloro-7-nitro-2,l,3- 
benzooxadizole; merocyanine 540; resorufin; rose bengal and 2,4-diphenyl- 
3(2H)-furanone. Preferably, the fluorescent detectable moiety is a fluorescein or 
^ rhodamine dye. 

Another preferred detectable moiety is colloidal gold. The colloidal 
25 gold particle is typically 40 to 80 nm in diameter. The colloidal gold may be 
attached to a labeling compound in a variety of ways. In one embodiment, the 
linker moiety of the nucleic acid labeling compound terminates in a thiol group 
(-SH), and the thiol group is directly bound to colloidal gold through a dative 
bond. See Mirkin et al. Nature 1996, 382, 607-609. In another embodiment, it 
30 is attached indirectly, for instance through the interaction between colloidal gold 
conjugates of antibiotin and a biotinylated labeling compound. The detection of 
the gold labeled compound may be enhanced through the use of a silver 
enhancement method. See Danscher et al. J. Histotech 1993, 16, 201-207. 
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The connecting groups (M) m may serve to covalently attach the linker 
group (L) to the detectable moiety (Q). Each M group can be the same or 
different and can independently be any suitable structure that will not interfere 
with the function of the labeling compound. Nonlimiting examples of M groups 

5 include the following: amino alkyl, -CO(CH 2 ) 5 NH- -CO-, -NH-, -CO(O)-, 
-CO(NH)-, -(CH 2 )iO-, -(CH 2 )jNH- -C(0)(CH 2 ) h O-, 
-CO(CH 2 ) 5 NHCO(CH 2 ) 5 NH- -NH(CH 2 CH 2 0) k NH-, and -CO(CH 2 ) 5 -; 
wherein, k is an integer from 1 to about 5, preferably k is 1 or 2; m is an integer 
ranging from 0 to about 5, preferably 0 to about 3; h, i and j are independently 

10 integers from 1 to about 5, preferably 1 to about 3. 

Synthesis of Nucleic Acid Labeling Compounds 

Figure 2 illustrates a synthetic route to nucleic acid labeling 
compound, 4, starting from 2-amino-5-(P-D-ribofuranosyl)-4(l//)- 
pyrimidinone, 1. The hydroxy groups are protected as the trimethyl silylethers 

15 according to the procedure described in J. AntibioL 1977, 30, 129. The Nl 
nitrogen atom is alkylated using a substituted iodoamide havng a biotin label 
attached, according to the procedure described in Tetrahedron 1984, 40, 33. 
(PEO-iodoacetyl biotin can be purchased from Pierce Chemical Co.) The silyl 
protecting groups are removed, providing a compound having free alcohol 

20 groups using acetic acid in methanol. The product is converted into a 5'- 
triphosphate to afford labeled compound 4. 

Figures 3 illustrates a synthetic route to nucleic acid labeling 
compounds, 6 and 7, starting from 2-amino-5-(P-D-ribofuranosyl)-4(l#)- 
pyrimidinone, 1. The hydroxy groups are protected as the trimethyl silylethers 

25 according to the procedure described in /. Antibiot. 1977, 30, 129. The Nl 

nitrogen atom is alkylated according to the procedure described in Tetrahedron 
1984, 40, 33, using ethyl iodoacetate. The ester is converted to amide, 5, using 
ethylene diamine. The biotin labeled compound is prepared by reacting amide, 
5, with biotin-dPEG4™-NHS (purchased from Quanta Biodesign). 

30 Alternatively, amide 5 is reacted with 5-carboxyfluorescein-NHS. The silyl 
protecting groups are removed to provide a compound having free alcohol 
groups, using acetic acid in methanol. The product is converted into a 5'- 
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triphosphate using standard phosphorylation conditions to afford, respectively, 
nucleic acid labeling compounds 6 and 7. 

Figure 4 illustrates an alternate synthetic route to nucleic acid 
labeling compounds, 6 and 7, starting from 2-amino-5-(P-D-ribofuranosyl)- 

5 4(1//)- pyrimidinone, 1. The hydroxy groups are protected as acetyl esters using 
acetic anhydride in pyridine. The Nl nitrogen atom is alkylated according to the 
procedure described in Tetrahedron 1984, 40, 33, using ethyl iodoacetate. The 
ester is converted to amide, 8, using ethylene diamine. The biotin labeled 
compound is prepared by reacting amide, 8, with biotin-dPEG4 -NHS 

10 (purchased from Quanta Biodesign). Alternatively, amide 8 is reacted with 5- 
carboxyfluorescein-NHS. The silyl protecting groups are removed to provide a 
compound having free alcohol groups, using acetic acid in methanol. The 
product is converted into a 5' -triphosphate using standard phosphorylation 
conditions to afford, respectively, nucleic acid labeling compounds 6 and 7. 

1 5 Figure 5 illustrates a synthetic route to nucleic acid labeling 

compounds, 10 and 11, starting from 2-amino-5-(P-D-ribofuranosyl)-4(l//)- 
pyrimidinone, 1. The hydroxy groups are protected as acetyl esters using acetic 
anhydride in pyridine. The Nl nitrogen atom is alkylated according to the 
procedure described in K. Muhlegger, et al 1996, WO 96/28640, using methyl 

20 acrylate in triethyl amine. The ester, 8 is converted to amide, 9, using ethylene 
diamine. The biotin labeled compound is prepared by reacting amide, 9, with 
biotin-dPEG4™-NHS (purchased from Quanta Biodesign). Alternatively, amide 
9 is reacted with 5-carboxyfluorescein-NHS. The product is converted into a 5'- 
triphosphate using standard phosphorylation conditions to afford, respectively, 

25 nucleic acid labeling compounds 10 and 11. 

Figure 6 illustrates a synthetic route to nucleic acid labeling 
compounds, 14 and 15, starting from 2-amino-5-(P-D-ribofuranosyl)-4(l#)- 
pyrimidinone, 1 . The hydroxy groups are protected as acetyl esters using acetic 
anhydride in pyridine. The Nl nitrogen atom is alkylated according to the 
30 procedure described in Tet Lett 1995 36, 3261, using methyl propynoate. The 
ester, 12 is converted to amide, 9, using ethylene diamine. The biotin labeled 
compound is prepared by reacting amide, 9, with biotin-dPEG4™-NHS 
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(purchased from Quanta Biodesign). Alternatively, amide 9 is reacted with 5- 
carboxyfluorescein-NHS. The product is converted into a 5' -triphosphate using 
standard phosphorylation conditions to afford, respectively, nucleic acid labeling 
compounds 14 and 15. 

5 Figure 7 illustrates a synthetic route to nucleic acid labeling 

compounds, 17 and 18, starting from 2-amino-5-(p-D-ribofuranosyl)-4(l//)- 
pyrimidinone, 1. The hydroxy groups are protected as the trimethyl silylethers 
according to the procedure described in J. Antibiot, 1977 , 50, 129. The Nl 
nitrogen atom is alkylated, using 2-(2-chloromethoxy-ethyl)-isoindole-l,3-dione, 

10 according to the procedure described in Chemo 1985, 31, 1 51 . The product, 16 
is treated with hydrazine in ethanol to remove the protecting groups. The biotin 
labeled compound is prepared by reacting the ether-amine with biotin~dPEG4™- 
NHS (purchased from Quanta Biodesign) using standard conditions. 
Alternatively, the ether-amine is reacted with (Ac)2Fl-NHS using standard 

15 conditions. The product is converted into a 5' -triphosphate using standard 
phosphorylation conditions to afford, respectively, nucleic acid labeling 
compounds 17 and 18. 

Figure 8 illustrates a synthetic route to nucleic acid labeling 
compounds, 20 and 21, starting from 2-amino-5-(P-D-ribofuranosyl)-4(l//> 
pyrimidinone, 1 . The hydroxy groups are protected as acetyl esters using acetic 
anhydride in pyridine. The Nl nitrogen atom is alkylated using (6-trifluoro- 
acetamido)hexyl alcohol in the presence of triphenyl phosphane ((Ph^P), and 
diethylazodicarboxylate (DEAD), according to the procedure described in 
Brossette, T. et ai, y. Org, Chem. 1999, 64, 5083 and Nuclesies, Nucleotides and 
Nucleic Acids 2000, 19, 867. The Nl nitrogen atom is alkylated using (6- 
trifluoroacetamido)hexyl alcohol according to the procedure described in J. 
AntibioL 1977, 30, 129. The protecting groups are removed to provide a 
compound having free amine and alcohol groups, using ammonia in methanol. 
The biotin labeled compound is prepared by reacting the ether-amine with 
biotin-dPEG4™-NHS (purchased from Quanta Biodesign) using standard 
conditions. Alternatively, the ether-amine is reacted with (Ac) 2 Fl-NHS using 
standard conditions. The product is converted into a 5' -triphosphate using 
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standard phosphorylation conditions to afford, respectively, nucleic acid labeling 
compounds 20 and 21. 

Figure 9 illustrates a synthetic route to nucleic acid labeling 

compounds, 23 and 24, starting from 2-amino-5-(P-D-ribofuranosyl)-4(l//)- 
5 pyrimidinone, 1. The hydroxy groups are protected as acetyl esters using acetic 
anhydride in pyridine. The Nl nitrogen atom is alkylated using 2-(6-iodo- 
hexyl)-isoindole-l,3-dione according to the procedure described in J. Antibiot. 
1977 r , 30, 129. The product, 22 is treated with hydrazine in ethanol to remove 
the protecting groups. The biotin labeled compound is prepared by reacting the 
10 free amine with biotin-dPEG4™-NHS (purchased from Quanta Biodesign) using 
standard conditions. Alternatively, the ether-amine is reacted with (Ac)2Fl-NHS 
using standard conditions. The product is converted into a 5 '-triphosphate using 
standard phosphorylation conditions to afford, respectively, nucleic acid labeling 
compounds 23 and 24. 

1 5 Figure 10 illustrates a synthetic route to nucleic acid labeling 

compounds, 36, starting from [l,4]Dioxan-2-one, Shahi, S.P., et al y J. Org. 
Chem. (1999), 64: 4509-1 1, Nishimura, T., et a/., J, Org. Chem. 1999, 64: 6750- 
55, Nishida, H„ et a/., J. Polym. Sci. 2000, 38: 1560-67; (2-Hydroxy-ethoxy)- 
acetic acid methyl ester,Kitano, M.; and Ohashi, N., EP 787728 Al(1997),ora 

20 polymer thereof, Nishida H, at al J. Polym, ScU, (2000), 38: 1560-67, 33. The 
compound is reacted with a diamine having a suitable linker moiety, e.g., a 
polyethylene oxide, alkylene, a combination there of and the like. The labeled 
compound is prepared by reacting the ether-amine with a suitable labeling 
compound using standard conditions. The product is converted into a 5'- 

25 triphosphate using standard phosphorylation conditions to afford, respectively, 
nucleic acid labeling compound 36. 

Nucleic Acid Labeling 

Nucleic acids can be isolated from a biological sample or 
synthesized, on a solid support or in solution for example, according to methods 
30 known to those of skill in the art. As used herein, there is no limitation on the 
length or source of the nucleic acid used in a labeling process. Exemplary 
methods of nucleic acid isolation and purification are described in Theory and 
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Nucleic Acid Preparation. In Laboratory Techniques in Biochemistry and 
Molecular Biology: Hybridization With Nucleic Acid Probes; P. Tijssen, Ed.; 
Part I; Elsevier: N.Y., 1993. A preferred method of isolation involves an acid 
guanidinium-phenol-chloroform extraction followed by oligo dT column 
5 chromotography or (dT)n magnetic bead use. Sambrook et al. Molecular 

Cloning: A Laboratory Manual, 2nd ed.; Cold Spring Harbor Laboratory, 1989; 
Vols. 1-3; and Current Protocols in Molecular Biology; F. Ausubel et al. Eds.; 
Greene Publishing and Wiley Interscience: N.Y., 1987. 

In certain cases, the nucleic acids are increased in quantity through 
10 amplification. Suitable amplification methods include, but are not limited to, the 
following examples: polymerase chain reaction (PCR) (Innis, et al. PCR 
Protocols. A guide to Methods and Application; Academic Press: San Diego, 
1990); ligase chain reaction (LCR) (Wu and Wallace. Genomics 1989, 4, 560; 
Landgren, et al. Science 1988, 241, 1077; and Barringer, et al. Gene 1990, 89, 
15 117); transcription amplification (Kwoh et al. Proc. Natl Acad, Set USA 1989, 
86, 1 173); and self-sustained sequence replication (Guatelli, et al. Proc. Nat. 
Acad. Set USA 1990, 87, 1874). 

The nucleic acid labeling compound can be incorporated into a 
nucleic acid using a number of methods. For example, it can be directly attached 

20 to an original nucleic acid sample (e.g., MRNA, polyA mRNA, cDNA) or to an 
amplification product. Methods of attaching a labeling compound to a nucleic 
acid include, without limitation, nick translation, 3-end-labeling, ligation, in 
vitro transcription (IVT) or random priming. Where the nucleic acid is an RNA, 
a labeled riboligonucleotide is ligated, for example, using an RNA ligase such 

25 asT4 RNA Ligase. In The Enzymes; Uhlenbeck and Greensport, Eds.; Vol. XV, 
Part B, pp. 31-58; and, Sambrook et al., pp. 5.66-5.69. Terminal transferase is 
used to add deoxy-, dideoxy- or ribonucleoside triphosphates (dNTPs, ddNTPs 
or NTPs), for example, where the nucleic acid is single stranded DNA. 

The labeling compound can also be incorporated at an internal 

30 position of a nucleic acid. For example, PCR in the presence of a labeling 

compound provides an internally labeled amplification product. See, e.g., Yu et 

al. Nucleic Acids Research 1994, 22, 3226-3232. Similarly, IVT in the presence 

of a labeling compound can provide an internally labeled nucleic acid. 
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Probe Hybridization 

The nucleic acid to which the labeling compound is attached can be 
detected after hybridization with a nucleic acid probe. Alternatively, the probe 
can be labeled, depending upon the experimental scheme preferred by the user. 

5 The probe is a nucleic acid, or a modified nucleic acid, that is either attached to a 
solid support or is in solution. It is complementary in structure to the labeled 
nucleic acid with which it hybridizes. The solid support is of any suitable 
material, including polystyrene based beads and glass chips. In a preferred 
embodiment, the probe or target nucleic acid is attached to a glass chip, such as a 

10 GeneChip® product (Affymetrix, Inc., Santa Clara, CA). See International 

Publication Nos. WO 97/10365, WO 97/29212, WO 97/27317, WO 95/1 1995, 
WO 90/15070, and U.S. Pat. Nos. 5,744,305 and 5,445,934 which are hereby 
incorporated by reference. 

Because probe hybridization is often a step in the detection of a 
15 nucleic acid, the nucleic acid labeling compound must be of a structure that does 
not substantially interfere with that process. The steric and electronic nature of 
the labeling compound, therefore, is compatible with the binding of the attached 
nucleic acid to a complementary structure. 



20 EXAMPLES 

The following examples are offered to illustrate, but not to limit, the 
% present invention. 

General Experimental Details 

Reagents were purchased from Aldrich Chemical Company 
25 (Milwaukee, WI) in the highest available purity. All listed solvents were 
anhydrous. Intermediates were characterized by ! H NMR and mass 
spectrometry. 
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Example 1 

Synthesis of fluorescein derivatives of 2 , -amino-2 f -deoxvuridine triphosphate 
and S^amino^'-deoxvthvmidinetriphosphate (Scheme 1) 

To 0.5 umoles (50 uL of a 10 mM solution) of the amino nucleotide 
5 triphosphate (1 or 2) in a 0.5 ml ependorf tube was added 25 ul of a 1 M aqueous 
solution of sodium borate, pH 8, 87 uL of methanol, and 88 uL (10 umol, 20 
equiv) of a 100 mM solution of 5-carboxyfluorescein -X-NHS ester in methanol. 
The mixture was vortexed briefly and allowed to stand at room temperature in 
the dark for 15 hours. The sample was then purified by ion-exchange HPLC to 
1 0 afford the fluoresceinated derivatives 3 or 4 in about 78-84% yield. Relative 
efficiencies of incorporation of these compounds by TdT are shown in Table 1 . 

Scheme 1 

P /? 

H 4 P30 9 -0^ 0 0=K N^ Z ^(^NHCO^FL ^ h 4 P 3 Q,0^ O^N^ 



99 X - OH, Y= NH 2 Z= H 97 X = OH, Y = hmCOCCH^NHCOFL, Z= H 

98 X = NH 2 , Y= H, Z= CH 3 96 X= NHCO(CH 2 ) 5 NHCOFL, Y= H, Z= CH 3 



15 



Table 1. 

Incorporation of triphosphate compounds bv TdT. 



ooo B 

II II II ft / 

6- i- 6- \Ji 

X Y 



TdT Labeling Efficiencies % Labeled 

X(3') Y(2 ( ) B(1'b) 40U 160U 



OH H uracil 100.0 100.0 
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Example 2 

5 Synthesis of Nl -labeled 2-Amino-5^p-D-ribofuranosvlV4f lffl-pvrimidinone 

Synthesis of the Nl -labeled 2-amino-5-(P-D-ribofuranosyl)-4(li/)- 
pyrimidinone, 55, involved alkylation at Nl using conditions similar to those 
described by Muehlegger, et al. (WO 96/28640) for the Nl -alkylation of 
pyrazalo-[4,3-d]pyrimidines (Figure 3A). 

10 Example 3 

Synthesis of N-f 6-(fluorescein-S>carboxamido)hexanovn-morpholino uridine 
triphosphate (Scheme 2). 

Morpholino-uracil tosylate salt 25 (30 mg) was co-evaporated with 
pyridine (3x3 ml) and dissolved in 2 ml of pyridine and cooled to 0°C. 

1 5 Trifluoroacetic anhydride (30 uL; was added and stirred for i hour. The reaction 
was followed by HPLC until complete. The pyridine was removed and the 
residue was dissolved in 1 ml of water and filtered. The product was purified by 
HPLC on a Waters C-l 8 bondapak cartridge (Buffer : A= 50 mM TEM pH 7.0; 
B= acetonitrile) using a gradient of 0-25%B in 30 minutes (retention time = 22 

20 min.). The product was desalted on a Sep-Pak cartridge and freeze-dried to give 
151 mg of 26. Phosphorylation of 2 using the POCl 3 method gave 27. The 
removal of the trifluoroacetyl group with cone. NH40H at 50°C for 30 min to 
provide the free amine, 28. Conjugation of 28 to 5-carboxyfluoroscein- 
aminocaproic acid N-hydroxysuccinimide (Fl-X-NHS) under standard 
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conditions provided amide 29. The mass spectral and NMR data for compounds 
25-29 were consistent with the proposed structures. 

Scheme 2 

H. 0 ^O^NH 0 (CF , C0)2/pyr | ^O^fVN^O 

N | 

H TsOH C(0)CF 3 

25 26 

_P O 

') Q> c ' p 30A 0 -v0 NH 4 OH 



I'V^ — N_ 




N 

2. ) PjO, 4 - I 

3. ) .,/Pyr/H,0 C <°) CF 3 

27 

Q p 3 o 9 ^ ~ n Vn.H 



4 - n / — IMM 



WW 0 V-N^=0 F| - X - NH \ 

N C(0)CH 2 ) 5 NH-C(0)-Fl 
H 

28 29 



Example 4 

The compounds of the invention having general formulas I, IA IB, II, 
III, IIIA. IIIB, and IV as disclosed herein can be prepared using available 
reagents and procedures that are standard in the art. In addition, examples of 
1 0 these procedures are illustrated in Figures 2- 1 0. 

Example 5 

Procedure for HPLC Analysis of Enzymatic Incorporation of Modified 
Nucleotides. 

Reaction Conditions 

15 TdJ 

3 uM dTi6 template 
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15(30)uMNTP 

40 U TdT (Promega) 

IX buffer, pH 7.5 (Promega) 

Procedure: incubate lhr. at 37 °C, then for 10 min. at 70 °C , 
5 followed by the addition of EDTA (2 mM final concentration) in a volume of 50 
uL 



HPLC Analysis 
Materials and Reagents 

10 4.6 mm X 250 mm Nucleopac P A- 1 00 ion-exchange column 

(Dionex) buffer A: 20 mM NaOH (or 20 mM Tris pH 8, in the case of TdT 
incorporation of nucleotide triphoshates that are not dye-labeled) buffer B: 
20mM NaOH, 1M NaCl (or 20 mM Tris pH 8, 1M NaCl, in the case of TdT 
incorporation of nucleotide triphoshates that are not dye-labeled) 

15 General Procedure 

Dilute the reaction with 50 uL of buffer A. Inject 50 uL of this 
sample onto the HPLC column and fractionate using a gradient of 5 to 100% 
buffer B over 30 minutes at a flow rate of 1 mL/min.. Detect the peaks 
simultaneously at 260 nm absorbance and the absorbance maximum of the dye 
20 (or the fluorescence emission maximum of the dye). 

The incorporation efficiency is expressed as the fraction of 
oligonucleotide that is labeled. This number is determined by dividing the peak 
area measured at 260 nm absorbance of the labeled oligonucleotide by the sum 
of the peak areas of the unlabeled and labeled oligonucleotide. ( The retention 
25 time of fluorescein-labeled dTi6 is on the order of 2 to 3 min. longer than the 

unlabeled dTi6.) The error in this type of assay is about 10%. The percentage 

labeling efficiency for 4 types of nucleic acid labeling compounds is shown 
below in Table 1 . 
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Example 6 

Labeled N-(2-hvdroxvethoxv)ethvl 2-O-triphosphates (Scheme 3V 

Compounds having general formula can be prepared using available 
reagents and procedures that are standard in the art. In addition, examples of 
5 these procedures are illustrated in Schemes 3 and Figure 10. 

Scheme 3 



O 

A 



HO' 



NH 2 



NH-X-CO-NHS 
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X = "linker n = 
-(CH 2 )n-; 

-(CH 2 CH 2 0)nCH 2 CH r ; 
etc. 



1) phosphorylation i? i? 9 _ 

2) NH 4 OH 0 -P-0-P-0-j»-0 

3) R-CO-NHS OOO 
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R = detectable label 

10 

Example 6A 

Labeled 2-(2-hvdroxvethvQacetamide 2-O-triphosphates (Figure 10). 

[ 1 ,4]Dioxan-2-one, (2-Hydroxy-ethoxy)-acetic acid methyl ester, or a 
polymer thereof, 33, is reacted with a diamine having a linker moiety, e.g., a 
15 polyethylene oxide, alkylene, a combination there of. The labeled compound is 
prepared by reacting the ether-amine with a suitable labeling compound using 
standard conditions. The product is converted into a 5' -triphosphate using 
standard phosphorylation conditions to afford, respectively, nucleic acid labeling 
compound 36. 

20 
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Example 7 

Synthesis of N-alkvl 2 , -amino-2 > -deoxvuridine triphosphate (Scheme 4). 

4,4-Diethoxy-butyl amine is reacted with an activated N-labeled 
caproyl amine, 37 to provide the N-labeled diethoxy butyl caproyl amide, 38. 
5 The amide, 38, is reacted with aqueous hydrochloric acid (0.04 N) to produce the 
N-labeled aldehyde, the aldehyde is reacted with phosphorylated 1 -(3- Amino-4- 
hydroxy-5-hydroxymethyl-tetrahydro-furan-2-yl)-lH-pyrimidine-2,4-dione, 39, 
to furnish labeled N-alkyl 2'-amino-2 , -deoxyuridine triphosphate, 40. The 
reaction is illustrated in Scheme 4. 
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Example 8 

Synthesis of 2 , -0-(6-(Fluorescein-5-carboxamido)hexvnuridine 5'-Q- 
triphosphate (Scheme 5V 

The protected phthalimide, 41, (available from RI Chemicals, 500 
mg), is reacted successively with a) hydrazine/ethanol, b) Trifluoroacetic acid- 
imidazole at 0°C and c) Acetic acid/methanol to provide compound, 42. The 
alcohol-amine is converted into a 5' -triphosphate using standard phosphorylation 
conditions followed by conjugation to a label to afford, respectively, nucleic acid 
labeling compounds, 43. 
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Scheme 5 



1) hydrazine/ethanol 

2) TFA-Im, 0°C 

3) AcOH/methanol 
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Example 9 

Synthesis of 2'-S fl^-(6-(Fluorescein-5-carboxamido)hexv1V 
15 aminoethvldithiouridine S'-O- triphosphate (Scheme 6V 

3-Hydroxy-2-hydroxymethyl-2,3,3a,9a-tetrahydro- 
furo[2 , ,3 , :4,5]oxazolo[3,2>a]pyrimidin-6-one, 44, is treated with acetic anhydride 
in pyridine to protect fhe hydroxy groups. The protected compound, 45, is 
reacted with thioacetic acid/dioxane at 100°C, according to the procedure in 
20 J.Chem. Soc. Perkin Trans 7, 1997, 2587. the product is treated with ammonia 
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provide thiol, 46. The thiol is reacted with disulfide, 99, to provide disulfide, 47. 
the disulvide is converted into a 5 '-triphosphate using standard phosphorylation 
conditions followed by reaction with a labeled reagent to afford, respectively, 
nucleic acid labeling compounds, 43. 



Scheme 6 




AcjO 



pyridine 



44 



l)AcSH, dioxane/100°C 



2) NH 2 OH 



TFA-NHCH 2 CH 2 S-SO,Me 



99 



1) phosphorylation 

2) NH 2 0H 

3) Biotin-NHS or 
Fluorescein-NHS / pH8 



AcO 




45 




OH SH 



46 




HN' 

H 4 P 3 O 10 w I 



o7i H 



NH-TFA 



NH-X 



48 X is biotin 

49 X is fluorescein 



10 



All patents, patent applications, and literature cited in the 
specification are hereby incorporated by reference in their entirety. In the case 
of any inconsistencies, the present disclosure, including any definitions therein 
will prevail. 
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The invention has been described with reference to various specific 
and preferred embodiments and techniques. However, it should be understood 
that many variations and modifications may be made while remaining within the 
spirit and scope of the invention. 
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Claims 

1 . A nucleic acid labeling compound having the formula: 

NH 2 




wherein A is hydrogen or a functional group that permits the attachment of the 
nucleic acid labeling compound to a nucleic acid; 

X is O, S, NRi or CHR 2 , wherein R| and R2 are, independently, hydrogen, alkyl or 
aryl; 

Y is hydrogen, N 3 , F, OR 9 , SR 9 or NHRq, wherein R9 is hydrogen, alkyl or aryl; 
Z is hydrogen, N 3 , F, OR 9 , SR9 or NHR| 0 , wherein Rio is hydrogen, alkyl or aryl; 
L is a linker moiety; Q is a detectable moiety; and 

M is a connecting group, wherein m is an integer ranging from 0 to about 20. 

2. The nucleic acid labeling compound of claim 1, wherein A is H or H4O9P3-; X is O; 

Y is H or OR 9 , wherein R 9 is hydrogen, alkyl or aryl; Z is hydrogen, N 3 , F or ORjo, 
wherein R| 0 is hydrogen, alkyl or aryl; and L is amido alkyl and M is -(Mi) a -(M 2 )b- 
wherein a and b are independently integers from 0 to about 5 and the sum of a and b 
is not greater than 15. 

3. The nucleic acid labeling compound of claim 1 or 2, wherein L is -(CH 2 ) n C(0)NR3-, 
wherein R 3 is hydrogen, alkyl or aryl and n is an integer ranging from about 1 to 
about 10; Mi is -(CH 2 )iO- and M 2 is -(CH 2 )jNH- wherein i and j are independently 
integers from 1 to about 5. 

4. The nucleic acid labeling compound of any of claims 1 to 3, wherein Y is H or OH; 
Z is H or OH; -L is -CH 2 C(0)NH-; M, is -<CH 2 CH 2 0) 3 - and M 2 is -CH 2 CH 2 NH-; 
and Q is biotin or a carboxyfluorescein. 
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5. The nucleic acid labeling compound of any of claims 1 to 4, wherein Y is OH; Z is 
OH; -L-(M) m - is -CH 2 C(0)NH-(CH 2 CH20)3-CH 2 CH 2 NH-; and Q is biotin. 

6. The nucleic acid labeling compound of any of claims 1 to 4, wherein Y is OH; Z is 
OH; -L-(M) m - is .CH 2 C(0)NH-(CH 2 CH 2 0) 3 -CH 2 CH 2 NH-; Q is carboxyfluorescein. 

7. The nucleic acid labeling compound of claim 1 , wherein A is H or H4O9P3-; X is O; 
Y is H or OR9, wherein R 9 is hydrogen, alkyl or aryl; Z is hydrogen, N 3 , F or OR| 0 , 
wherein Rio is hydrogen, alkyl or aryl; and L is amido alkyl and M is 
-(Mi) a -(M 2 ) b -(M 3 ) c -(M4)d- wherein a, b, c, and d are independently integers from 0 to 
about 5 and the sum of a, b, c, and d is not greater than 15, 

■ 

8. The nucleic acid labeling compound of claim 7, wherein L is -(CH2)nC(0)NR3-, 
wherein R3 is hydrogen, alkyl or aryl and n is an integer ranging from about 1 to 
about 10; each M is independently -C(0)(CH2)kO-, -(CH2)iO- or 
-(CH2)jNH-,wherein i, j and k are independently integers from 1 to about 5. 

9. The nucleic acid labeling compound of claim 7 or 8, wherein L is -CH2C(0)NR 3 -, 
wherein Rj is hydrogen or, alkyl; M| is-(CH 2 ) 2 NH-, M 2 is -0(0X0112)20-, M 3 is 
-(CH 2 CH 2 0)j- and M4 is -(CH 2 )2NH-. 

1 0. The nucleic acid labeling compound of any of claims 7 to 9, wherein Y is OH; Z is 
OH; -L-(M) m - is 

-CH2C(0)NH-CH2CH 2 NH-C(0)CH2CH20-(CH2CH 2 0)3-CH 2 CH2NH-; Q is biotin. 

1 1 . The nucleic acid labeling compound of any of claims 7 to 9, wherein Y is OH; Z is 
OH; -L-(M) m - is -CH 2 C(0)NH-CH 2 CH2NH-; m is 1; and Q is carboxyfluorescein. 

p 

12. The nucleic acid labeling compound of claim 7 or 8, wherein L is -(CH 2 ) 2 C(0)NR 3 -, 
wherein R 3 is hydrogen or, alkyl; Mi is-(CH 2 ) 2 NH- M 2 is -C(0)(CH 2 )20- f M 3 is 
-(CH 2 CH 2 0) 3 - and M> is -(CH 2 ) 2 NH-. 

13. The nucleic acid labeling compound of claims 7, 8, or 12, wherein Y is OH; Z is OH; 

-L-(M) m - is -CH 2 C(0)NH.CH 2 CH 2 NH-C(0)CH 2 CH 2 0-(CH 2 CH 2 0) 3 -CH 2 CH 2 NH~; 
Q is biotin. 

14. The nucleic acid labeling compound of claims 7, 8, or 12, wherein Y is OH; Z is OH; 
-L-(M) m - is -CH 2 CH 2 C(0)NH-CH 2 CH 2 NHs m is 1; and Q is carboxyfluorescein. 
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15. The nucleic acid labeling compound of claim 1, wherein A is H or H4O9P3-; X is O; 

Y is H or OR9, wherein R 9 is hydrogen, alkyl or aryl; Z is hydrogen, N3, F or OR10, 
wherein R| 0 is hydrogen, alkyl or aryl; and L is functionalized alkyl and M is 
-(Mi) a -(M2)b-(M3)c-(M2)d- wherein a, b, c, and d are independently integers from 0 to 
about 5 and the sum of a, b, c, and d is not greater than 1 5. 

16. The nucleic acid labeling compound of claim 15, wherein L is -(CH2) n O-, wherein n 
is an integer ranging from about 1 to about 1 0; each M is independently 
-C(0)(CH 2 ) k O-, -(CH 2 )iO- or -(CH 2 )jNH-, wherein i, j and k are independently 
integers from 1 to about 5. 

17. The nucleic acid labeling compound of claim 15 or 16, wherein L is -CH2O-; Mi is 
-(CH 2 ) 2 NH- M 2 is -C(0)(CH 2 ) 2 0-, M 3 is -(CH 2 CH 2 0) 3 - and M 4 is -(CH 2 ) 2 NH-. 

18. The nucleic acid labeling compound of claim 1 5, 1 6, 1 7, wherein Y is OH; Z is OH; 
-L-(M) m - is -CH 2 0-CH 2 CH 2 NH-C(0)(CH 2 ) 2 0-(CH 2 CH 2 0) 3 -CH 2 CH 2 NH~; Q is 
biotin. 

1 9. The nucleic acid labeling compound of claim 1 7, wherein Y is OH; Z is OH; 
-L-(M) m - is -CH 2 0-CH 2 CH 2 NH-; m is 1 ; and Q is carboxyfluorescein. 

20. The nucleic acid labeling compound of claim 15, wherein L is -(CH 2 ) n NR 3 -, wherein 
R 3 is hydrogen, alkyl or aryl and n is an integer ranging from about 1 to about 10; 
each M is independently -C(0)(CH 2 ) k O-, -(CH 2 )iO- or -(CH 2 )jNH- wherein i, j and 
k are independently integers from 1 to about 5. 

21 . The nucleic acid labeling compound of claim 20, wherein L is -(CH 2 ) 6 NH-; M| 
is-(CH 2 ) 2 NH-, M 2 is -C(0)(CH 2 ) 2 0-, and M 3 is -(CH 2 CH 2 0) 3 -. 

22. The nucleic acid labeling compound of claim 2 1 , wherein Y is OH; Z is OH; 
-L-(M) m - is -{CH 2 ) 6 NH-C(0)(CH 2 ) 2 0-(CH 2 CH20) 3 -CH 2 CH 2 NH-; and Q is biotin. 

23. The nucleic acid labeling compound of claim 21 , wherein Y is OH; Z is OH; L is - 
(CH 2 ) 6 NH-; m is 0; and Q is carboxyfluorescein. 

24. The nucleic acid labeling compound of claim 1, wherein A is H or H 4 09P 3 -; X is O; 

Y is H or OR 9 , wherein R 9 is hydrogen, alkyl or aryl; Z is hydrogen, N 3 , F or OR10, 
wherein Ri 0 is hydrogen, alkyl or aryl; and L is alkenyl alkyl and M is 
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-(M|)a-(M 2 )b-(M3)c-(M4)d- wherein a, b, c, and d are independently integers from 0 to 
about 5 and the sum of a, b, c, and d is not greater than 1 5. 

25. The nucleic acid labeling compound of claim 7, wherein L is - structure 
-CH=CH-(CH 2 ) n C(0)N(R6)-, wherein R* is hydrogen, alkyl or aryl and n is an 
integer ranging from about 0 to about 1 0; each M is independently -C(0)(CH 2 ) k O-, 
-(CH 2 )iO- or -(CH 2 )jNH-, wherein i, j and k are independently integers from 1 to 
about 5. 

26. The nucleic acid labeling compound of claim 2, wherein L is -CH=CH-C(0)N(H); 
M, is-(CH 2 ) 2 NH-, M 2 is -C(0)(CH 2 ) 2 0-, M 3 is -(CH 2 CH 2 0) 3 - and M, is 
-(CH 2 ) 2 NH-. 

27. The nucleic acid labeling compound of claim 12, wherein -L-(M) m - is 

-CH=CH-C(0)NH-CH 2 CH 2 NH.C(0)CH 2 CH 2 0-(CH 2 CH 2 0) 3 -CH 2 CH 2 NH-; Y is 
OH; Z is OH; Q is biotin. 

The nucleic acid labeling compound of claim 12, wherein Y is OH; Z is OH; 
-L-(M) m - is -CH=CH-C(0)NH-CH 2 CH 2 NH-; m is 1; and Q is carboxyfluorescein. 

The nucleic acid labeling compound of any of claims 1 to 28, having the formula: 

NH 2 

N ^ N -L— (MJbtQ 




wherein A, X, Y, Z, L, M, m, and Q are as defined herein. 



28. 
29. 
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30. A nucleic acid labeling compound of any of claims 1 to 28, having the formula: 

NH 2 

N ^- N -L— (M)ffrQ 

oV 

Y 1 

wherein A, X, Y, Z, L, M, m, and Q are as defined herein. 

31. A nucleic acid labeling compound having the formula: 



L— (M)— Q 




wherein A is hydrogen or a functional group that permits the attachment of the 

nucleic acid labeling compound to a nucleic acid; 

L is a linker moiety; 

Q is a detectable moiety; and 

M is a connecting group, wherein m is an integer ranging from 0 to about 3. 

32. The nucleic acid labeling compound of claim 3 1 , wherein A is H or H4O9P3-; L is 
alkanoyl; and Q is biotin or a carboxyfluorescein; M is -NR 3 - where R 3 is hydrogen 
or alkyl, and wherein m is 1 or 0. 

33. The nucleic acid labeling compound of claim 32, wherein L is -C(0)(CH 2 ) n - wherein 
n is an integer ranging from about 1 to about 10; M is -NH-; Q is biotin or a 
carboxyfluorescein. 

34. The nucleic acid labeling compound of claim 33, wherein -L-(M) m - is 
-C(0)(CH 2 ) 6 -NH-; and Q is biotin. 
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35. The nucleic acid labeling compound of claim 33, wherein -L-(M) m - is 
-C(0)(CH 2 )6-NH-; and Q is carboxyfluorescein. 

36. A nucleic acid labeling compound having the formula: 




wherein A is hydrogen or a functional group that permits the attachment of the 
nucleic acid labeling compound to a nucleic acid; 

X|is O, S, NR| or CHR 2 , wherein R\ and R 2 are, independently, hydrogen, alkyl or 
aryl; 

R15 is hydrogen, alkyl or aryl; 

Yi is hydrogen, N 3 , F, OR 9 , SR 9 or NHR 9 , wherein R 9 is hydrogen, alkyl or aryl; 
Z| is hydrogen, N 3 , F, OR 9 , SR 9 or NHR !0 , wherein R 10 is hydrogen, alkyl or aryl; 
wherein one of Yi or Z\ is a group having the formula -X 2 L-(M)nrQ; 
X 2 is O, S, or NRi6, and Ri6 is hydrogen, alkyl or aryl; 
L is a linker moiety; Q is a detectable moiety; and 

M is a connecting group, wherein m is an integer ranging from 0 to about 20. 

37. The nucleic acid labeling compound of claim 36, wherein A is H or H40 9 P 3 -; X) is 
O; Yj is hydrogen, OR 9 , wherein R 9 is hydrogen, or alkyl, or -NR, 6 -L-(M) m -Q; 
wherein R )6 is hydrogen, alkyl or aryl; Z\ is hydrogen ORio, wherein Rj 0 is 
hydrogen, or alkyl, or -NR| 6 -L-(M)nrQ .wherein R| 6 is hydrogen, or alkyl; L is N- 
alkyl amido; R )5 is hydrogen or alkyl; M is -(CH 2 ) n NR 3 -where R 3 is hydrogen or 
alkyl, and m is from 1 to about 1 5. 

38. The nucleic acid labeling compound of claim 37, wherein Y| is hydrogen or OH; Z| 
is -NH-L-(M) m -Q; L is -(CH 2 ) n NHC(0)- where n is an integer from 1 to about 10; 
M is -NH(CH 2 ) P - where p is an integer from 1 to about 10; R !5 is hydrogen or 
methyl; and Q is biotin or a carboxyfluorescein. 
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39. The nucleic acid labeling compound of claim 38 wherein Y| is OH; R15 is hydrogen; 
-L-(M) m - is -(CH 2 ) 4 NH-C(0)(CH 2 )5NH-; and Q is biotin. 

40. The nucleic acid labeling compound of claim 39, wherein Yi is OH; R15 is hydrogen; 
-L-(M) m - is -(CH 2 )4NH-C(0)(CH 2 )5NH-; and Q is a carboxyfluorescein. 

41. The nucleic acid labeling compound of claim 39 wherein Y\ is OH; R15 is methyl; 
-L-(M) m - is -(CH 2 )4NH-C(0)(CH 2 ) 5 NH-; and Q is biotin. 

42. The nucleic acid labeling compound of claim 39, wherein Y| is OH; R15 is methyl; 
-L-(M) m - is -(CH 2 ) 4 NH-C(0)(CH 2 ) 5 NH--; and Q is a carboxyfluorescein. 

43. The nucleic acid labeling compound of claim 37, wherein Yi is -NH-L-(M) m -Q; Z| is 
hydrogen or OH; L is -<CH 2 ) n NHC(0)- where n is an integer from 1 to about 10; M 
is -NH(CH 2 ) P - where n is an integer from 1 to about 10; R15 is hydrogen or methyl; 
and Q is biotin or a carboxyfluorescein. 

44. The nucleic acid labeling compound of claim 43, wherein Z\ is OH; R 15 is hydrogen; 
-L-(M)m- is -(CH 2 ) 4 NH-C(0)(CH 2 ) 5 NH-; and Q is biotin. 

■ 

45. The nucleic acid labeling compound of claim 43, wherein Z\ is OH; R ]5 is hydrogen; 
-L-(M) m - is -(CH 2 ) 4 NH-C(0)(CH 2 ) 5 NH-; and Q is a carboxyfluorescein. 

46. The nucleic acid labeling compound of claim 43, wherein Z\ is OH; Ri 5 is methyl; 
-L-(M) m - is -(CH 2 ) 4 NH-C(0)(CH 2 ) 5 NH-; and Q is biotin. 

47. The nucleic acid labeling compound of claim 43, wherein Z\ is OH; R| 5 is methyl; 
-L-(M) m - is .(CH 2 ) 4 NH-C(0)(CH 2 ) 5 NH-; and Q is a carboxyfluorescein. 

48. The nucleic acid labeling compound of claim 37, wherein A is H or H4O9P3-; X ( is 
O; Yi is hydrogen, OR 9 , wherein R 9 is hydrogen, or alkyl, or -0-L-(M) m -Q; Z\ is 
hydrogen OR| 0 , wherein R, 0 is hydrogen, or alkyl, or -0-L-(MVQ, or alkyl; L is 
alkylene; and m is from 1 to about 10. 

49. The nucleic acid labeling compound of claim 48, wherein Y| is hydrogen or OH; Z| 
is -0-L-(M)nrQ; L is -(CH 2 ) n - where n is an integer from 1 to about 12; Mis -NH-; 
Ris is hydrogen or methyl; and Q is biotin or a carboxyfluorescein. 

50. The nucleic acid labeling compound of claim 49, wherein Yi is OH; R 15 is hydrogen; 
-L-(M) m - is .(CH 2 ) 6 -NH-; and Q is biotin. 
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5 1 . The nucle 

-L~(M)m- 

52. The nucle 
-L~(M) m - 

53. The nucle 

-L-(M) m - 

54. The nucle 



c acid labeling compound of claim 49, wherein Yi is OH; Ri 5 is hydrogen; 
s -(CH 2 )6-NH-; and Q is a carboxyfluorescein. 

c acid labeling compound of claim 49, wherein Yi is OH; Rj 5 is methyl; 
s -(CH 2 ) 6 -NH-; and Q is biotin. 

c acid labeling compound of claim 49, wherein Y } is OH; R 15 is methyl; 
s -(CH 2 )6-NH-; and Q is a carboxyfluorescein. 



ic acid labeling compound of claim 37, wherein Y| is -0-L-(M) m -Q; Z\ is 
hydrogen or OH; L is -(CH 2 )n- where n is an integer from 1 to about 12; M is -NH-; 
Ri 5 is hydrogen or methyl; and Q is biotin or a carboxyfluorescein. 



55. The nucle 
-L-(M) m - 

56. The nucle 
-L-(M) m - 

57. The nucle 
-L-(M)m- 

58. The nucle 
-L-(M) m - 

59. The nucle 



c acid labeling compound of claim 54, wherein Z| is OH; R, 5 is hydrogen; 
s -(CH 2 ) 6 -NH-; and Q is biotin. 

c acid labeling compound of claim 54, wherein Z\ is OH; R 15 is hydrogen; 
s -(CH 2 ) 6 -NH-; and Q is a carboxyfluorescein. 

c acid labeling compound of claim 54, wherein Zi is OH; R 15 is methyl; 
s -(CH 2 ) 6 -NH-; and Q is biotin. 

c acid labeling compound of claim 54, wherein Z, is OH; R, 5 is methyl; 
s -(CH 2 ) 6 -NHC-; and Q is a carboxyfluorescein. 



ic acid labeling compound of claim 37, wherein A is H or H4O9P3-; X, is 
O; Yi is hydrogen, OR 9 , wherein R9 is hydrogen, or alkyl, or -S-L-(M) m -Q; Z| is 
hydrogen OR10, wherein R, 0 is hydrogen, or alkyl, or -S-L-(M) m -Q; L is alkyl thio; 
and m is from 1 to about 10. 

60. The nucleic acid labeling compound of claim 59, wherein Yi is hydrogen or OH; Zi 
is -S-L-(M) m -Q; L is -S-(CH 2 ) n - where n is an integer from 1 to about 10; M is - 
NH-; R15 is hydrogen or methyl; and Q is biotin or a carboxyfluorescein. 

61. The nucleic acid labeling compound of claim 60, wherein Y| is OH; R !5 is hydrogen; 
-L-(M) m - is -S-(CH 2 ) 2 -NH-; and Q is biotin. 

62. The nucleic acid labeling compound of claim 60, wherein Y| is OH; R J5 is hydrogen; 
-L-(M) m - is -S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 



48 



WO 02/072779 



PCT/US02/07584 



63. 
64. 
65. 

66. 
67. 
68. 
69. 
70. 



71. 



The nucleic acid labeling compound of claim 60, wherein Yi is OH; Ris is methyl; 
-L-(M) m - is -S-(CH 2 )2-NH-; and Q is biotin. 

The nucleic acid labeling compound of claim 60, wherein Y| is OH; R| 5 is methyl; 
-L-(M) m - is -S-(CH2)2-NH-; and Q is a carboxyfluorescein. 

The nucleic acid labeling compound of claim 37, wherein Y| is -S-L-(M) m -Q; Z\ is 
hydrogen or OH; L is -S-(CH 2 )„- where n is an integer from 1 to about 10; M is - 
NH-; Ru is hydrogen or methyl; and Q is biotin or a carboxyfluorescein. 

The nucleic acid labeling compound of claim 65, wherein Z\ is OH; Ri S is hydrogen; 
-L-(M) m - is -S-(CH 2 ) 2 -NH-; and Q is biotin. 

The nucleic acid labeling compound of claim 65, wherein Z| is OH; Ri 5 is hydrogen; 
-L-(M) m - is -S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 

The nucleic acid labeling compound of claim 65, wherein Zj is OH; R| 5 is methyl; 
-L-(M) m - is -S-(CH 2 ) 2 -NH-; and Q is biotin. 

The nucleic acid labeling compound of claim 65, wherein Z\ is OH; R| 5 is methyl; 
-L-(M) m - is -S-(CH 2 ) 2 -NH-; and Q is a carboxyfluorescein. 

A nucleic acid labeling compound having the formula: 



wherein A is hydrogen or a functional group that permits the attachment of the 
nucleic acid labeling compound to a nucleic acid; 

Xiis O, S, NRi or CHR 2 , wherein Rj and R 2 are, independently, hydrogen, alkyl or 
aryl; X 2 is a bond or alkylene; Q is a detectable moiety; and G is a connecting group. 

The nucleic acid labeling compound of claim 70, wherein A is H or H4O9P3-; X| is 
O; X 2 is a bond; and G is -C(0)NR3-(CH 2 ) m -(CH 2 CH 2 0)n-CH 2 CH 2 -C(0)NR 3 -, 
where R 3 is hydrogen or alkyl, and m and n are independently an integer from 1 to 
about 15. 
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72. The nucleic acid labeling compound of claim 71, wherein G is 
-C(O)hfH.(CH 2 ) m -(CH 2 CH 2 O) n .CH 2 CH 2 -C(0)NH- where m is from 1 to about 6 
and n is from 1 to about 4. 

73. The nucleic acid labeling compound of claim 70, wherein A is H or H4O9P3-; Xj is 
O; X 2 is CH 2 ; and G is -C(0)NR 3 -(CH 2 ) m -(CH 2 CH 2 0)n-CH 2 CH 2 .C(0)NR3", where 
R 3 is hydrogen or alkyl, and m and n are independently an integer from 1 to about 
15. 

74. The nucleic acid labeling compound of claim 73, wherein G is 
.C(0)NH^(CH 2 ) m .(CH 2 CH 2 0) n -CH 2 CH 2 -C(0)NH- wherein m is from 1 to about 6 
and n is from I to about 4. 

75. A nucleic acid derivative produced by coupling a nucleic acid labeling compound of 
claim 2, 7, 12, 15, 20, 24, 32, 37,or 71 with a nucleic acid. 

76. A hybridization product, wherein the hybridization product comprises the nucleic 
acid derivative of claim 75 bound to a complementary probe. 

77. The hybridization product of claim 76, wherein the probe is attached to a glass chip. 

78. A hybridization product, wherein the hybridization product comprises the nucleic 
acid derivative of claim 75 bound to a complementary probe. 

79. A method of synthesizing a labeled nucleic acid comprising attaching a nucleic acid 
labeling compound of claim 2, 7, 12, 15, 20, 24, 32, 37,or 71 to a nucleic acid. 

80. A method of detecting a nucleic acid comprising incubating a nucleic acid derivative 
according to claim 75 with a probe. 

81 . A method according to claim 80, wherein the probe is attached to a glass chip. 
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Fig. 3A 
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